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Foreword to Earlier Series Editions 


More than a generation of German-speaking students around the world have 
worked their way to an understanding and appreciation of the power and 
beauty of modern theoretical physics — with mathematics, the most fundamental 
of sciences — using Walter Greiner’s textbooks as their guide. 

The idea of developing a coherent, complete presentation of an entire field of 
science in a series of closely related textbooks is not a new one. Many older 
physicists remember with real pleasure their sense of adventure and discovery as 
they worked their ways through the classic series by Sommerfeld, by Planck and 
by Landau and Lifshitz. From the students’ viewpoint, there are a great many 
obvious advantages to be gained through use of consistent notation, logical 
ordering of topics and coherence of presentation; beyond this, the complete 
coverage of the science provides a unique opportunity for the author to convey 
his personal enthusiasm and love for his subject. 

The present five volume set, Theoretical Physics, is in fact only that part of the 
complete set of textbooks developed by Greiner and his students that presents 
the quantum theory. I have long urged him to make the remaining volumes on 
classical mechanics and dynamics, on electromagnetism, on nuclear and particle 
physics, and on special topics available to an English-speaking audience as well, 
and we can hope for these companion volumes covering all of theoretical physics 
some time in the future. 

What makes Greiner’s volumes of particular value to the student and 
professor alike is their completeness. Greiner avoids the all too common “it 
follows that...” which conceals several pages of mathematical manipulation and 
confounds the student. He does not hesitate to include experimental data to 
illuminate or illustrate a theoretical point and these data, like the theoretical 
content, have been kept up to date and topical through frequent revision and 
expansion of the lecture notes upon which these volumes are based. 

Moreover, Greiner greatly increases the value of his presentation by includ- 
ing something like one hundred completely worked examples in each volume. 
Nothing is of greater importance to the student than seeing, in detail, how the 
theoretical concepts and tools under study are applied to actual problems of 
interest to a working physicist. And, finally, Greiner adds brief biographical 
sketches to each chapter covering the people responsible for the development of 
the theoretical ideas and/or the experimental data presented. It was Auguste 
Comte (1798-1857) in his Positive Philosophy who noted, “To understand a 
science it is necessary to know its history”. This is all too often forgotten in 
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modern physics teaching and the bridges that Greiner builds to the pioneering 
figures of our science upon whose work we build are welcome ones. 

Greiner’s lectures, which underlie these volumes, are internationally noted 
for their clarity, their completeness and for the effort that he has devoted to 
making physics an integral whole; his enthusiasm for his science is contagious 
and shines through almost every page. 

These volumes represent only a part of a unique and Herculean effort to 
make all of theoretical physics accessible to the interested student. Beyond that, 
they are of enormous value to the professional physicist and to all others 
working with quantum phenomena. Again and again the reader will find that, 
after dipping into a particular volume to review a specific topic, he will end up 
browsing, caught up by often fascinating new insights and developments with 
which he had not previously been familiar. 

Having used a number of Greiner’s volumes in their original German in 
my teaching and research at Yale, I welcome these new and revised English 
translations and would recommend them enthusiastically to anyone searching 
for a coherent overview of physics. 


Yale University D. Allan Bromley 
New Haven, CT, USA Henry Ford II Professor of Physics 
1989 


Preface to the Third Edition 


The text Quantum Mechanics — An Introduction has found many friends among 
physics students and researchers so that the need for a third edition has arisen. 
There was no need for a major revision of the text but I have taken the 
Opportunity to make several amendments and improvements. A number of 
misprints and minor errors have been corrected and a few clarifying remarks 
have been added at various places. A few figures have been added or revised, in 
particular the three-dimensional density plots in Chap. 9. 

I am grateful to several colleagues for helpful comments, in particular to 
Prof. R.A. King (Calgary) who supplied a comprehensive list of corrections. I 
also thank Dr. A. Scherdin for help with the figures and Dr. R. Mattiello who 
has supervised the preparation of the third edition of the book. Furthermore 
I acknowledge the agreeable collaboration with Dr. H. J. Kdlsch and his team at 
Springer-Verlag, Heidelberg. 


Frankfurt am Main Walter Greiner 
July 1994 


Preface to the Second Edition 


Like its German companion, the English edition of our textbook series has also 
found many friends, so that it has become necessary to prepare a second edition 
of this volume. There was no need for a major revision of the text. However, 
I have taken the opportunity to make several minor changes and to correct a 
number of misprints. Thanks are due to those colleagues and students who 
made suggestions to improve the text. | am confident that this textbook will 
continue to serve as a useful introduction to the fascinating topic of quantum 
mechanics. 


Frankfurt am Main, Walter Greiner 
November 1992 


Preface to the First Edition 


Quantum Mechanics — An Introduction contains the lectures that form part of the 
course of study in theoretical physics at the Johann Wolfgang Goethe Univer- 
sity in Frankfurt. There they are given for students in physics and mathematics 
in their fourth semester. They are preceded by Theoretical Mechanics I (in the 
first semester), Theoretical Mechanics II (in the second semester), and Classical 
Electrodynamics (in the third semester). Quantum Mechanics I — An Introduc- 
tion then concludes the foundations laid for our students of the mathematical 
and physical methods of theoretical physics. Graduate work begins with the 
courses Thermodynamics and Statistical Mechanics, Quantum Mechanics II — 
Symmetries, Relativistic Quantum Mechanics, Quantum Electrodynamics, the 
Gauge Theory of Weak Interactions, Quantum Chromodynamics, and other, 
more specialized lectures. 

As in all the other fields mentioned, we present quantum mechanics accord- 
ing to the inductive method, which comes closest to the methodology of the 
research physicist: starting with some key experiments, which are idealized, 
the basic ideas of the new science are introduced step by step. In this book, 
for example, we present the concepts of “state of a system” and “eigenstate”, 
which then straightforwardly lead to the basic equation of motion, i.e. to the 
Schr6dinger equation; and, by way of a number of classic, historically important 
observations concerning the quantization of physical systems and the various 
radiation laws, we infer the duality of waves and particles, which we understand 
with Max Born’s conception of a “guiding field”. 

Quantum mechanics is then further developed with respect to fundamental 
problems (uncertainty relations; many-body systems; quantization of classical 
systems; spin within the phenomenological Pauli theory and through linear- 
ization of wave equations; etc.), applications (harmonic oscillator; hydrogen 
atom; Stern-Gerlach, Einstein-de Haas, Frank-Hertz, and Rabi experiments), 
and its mathematical structure (elements of representation theory; introduction 
of the S matrix, of Heisenberg, Schrédinger, and interaction pictures; eigendiffer- 
entials and the normalization of continuum wave functions; perturbation 
theory; etc.). Also, the elements of angular-momentum algebra are explained, 
which are so essential in many applications of atomic and nuclear physics. These 
will be presented in a much broader theoretical context in Quantum Mechanics — 
Symmetries. 

Obviously an introductory course on quantum theory cannot (and should 
not) cover the whole field. Our selection of problems was carried out according 
to their physical importance, their pedagogical value, and their historical impact 
on the development of the field. 
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Students profit in the fourth semester at Frankfurt from the solid mathe- 
matical education of the first two years of studies. Nevertheless, in these lectures, 
new mathematical tools and methods and their use have also to be discussed. 
Within this category belong the solution of special differential equations 
(especially of the hypergeometrical and confluent hypergeometrical differential 
equations), a reminder of the elements of matrix calculus, the formulation of 
eigenvalue problems, and the explanation of (simple) perturbation methods. As 
in all the lectures, this is done in close connection with the physical problems 
encountered. In this way the student gets a feeling for the practical usefulness of 
the mathematical methods. Very many worked examples and exercises illustrate 
and round off the new physics and mathematics. 

Furthermore, biographical and historical footnotes anchor the scientific 
development to the general side of human progress and evolution. In this 
context I thank the publishers Harri Deutsch and F.A. Brockhaus (Brockhaus 
Enzyklopddie, F.A. Brockhaus, Wiesbaden — marked by BR) for giving permis- 
sion to extract the biographical data of physicists and mathematicians from 
their publications. 

The lectures are now in their Sth German edition. Over the years many 
students and collaborators have helped to work out exercises and illustrative 
examples. For the first English edition I enjoyed the help of Maria Berenguer, 
Snjezana Butorac, Christian Derreth, Dr. Klaus Geiger, Dr. Matthias Grabiak, 
Carsten Greiner, Christoph Hartnack, Dr. Richard Hermann, Raffaele Mat- 
tiello, Dieter Neubauer, Jochen Rau, Wolfgang Renner, Dirk Rischke, Thomas 
Schonfeld, and Dr. Stefan Schramm. Miss Astrid Steidl drew the graphs and 
pictures. To all of them I express my sincere thanks. 

I would especially like to thank Mr. Béla Waldhauser, Dipl.-Phys., for his 
overall assistance. His organizational talent and his advice in technical matters 
are very much appreciated. 

Finally, I wish to thank Springer-Verlag; in particular, Dr. H.-U. Daniel, for 
his encouragement and patience, and Mr. Mark Seymour, for his expertise in 
copy-editing the English edition. 


Frankfurt am Main Walter Greiner 
July 1989 
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XIX 


1. The Quantization of Physical Quantities 


1.1 Light Quanta 


In order to explain physical phenomena caused by light, two points of view have 
emerged, each of which has its place in the history of physics. Almost simulta- 
neously in the second half of the seventeenth century the corpuscle theory was 
developed by Newton and the wave theory of light was created by Huygens. 
Some basic properties like the rectilinear propagation and reflection of light can 
be explained by both theories, but other phenomena, such as interference, the 
fact that light plus light may cause darkness, can be explained only by the wave 
theory. 

The success of Maxwell's electrodynamics in the nineteenth century, which 
interprets light as electromagnetic waves, seemed finally to confirm the wave 
theory. Then, with the discovery of the photoelectric effect by Heinrich Hertz 
in 1887, a development began which led ultimately to the view that light has to 
be described by particles or waves, depending on the specific problem or kind 
of experiment considered. The “particles” of light are called quanta of light or 
photons, the co-existence of waves and particles wave-particle duality. 

In the following we shall discuss some experiments which can be explained 
only by the existence of light quanta. 


1.2 The Photoelectric Effect 


The ejection of electrons from a metal surface by light is called the photoelectric 
effect. An experiment by Philipp Lenard showed that the energy of the detached 
electrons is given by the frequency of the irradiating light (Fig. 1.1). 
Monochromatic light yields electrons of a definite energy. An increase in 
light intensity leads to the emission of more electrons, but does not change their 
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Fig. 1.1. Measurement of 
the photoelectric effect: light 
(>) shines on a metal there- 
by liberating electrons (e 7) 
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Wa w 


Fig. 1.2. The linear increase 
of photoelectron energy 
with frequency w of the inci- 
dent light 


Fig. 1.3. Conservation of 
momentum in Compton 
scattering 


energy. This is in clear contradiction to classical wave theory, where the energy 
of a wave is given by its intensity. If we carry out the experiment with monochro- 
matic light of different frequencies, a linear dependence between energy and 
frequency is obtained, as shown in Fig. 1.2: 


Eo(atbo) . (1.1) 


The proportionality factor, ic. the slope of the straight line, is found to be 
Planck’s constant h divided by 2z, so that 


E = h(w — @,) = h(v — v,) (1.2) 


with w= 2 — 0 Oc 10m ye 

Einstein interpreted this effect by postulating discrete quanta of light 
(photons) with energy hw. Increasing the intensity of the light beam also 
increases the number of photons, which can break off correspondingly more 
electrons from the metal. 

In these experiments, a frequency limit @, appears, which depends on the 
kind of metal. Below this frequency limit, no electrons can leave the metal. This 
means that a definite escape energy haw is needed to raise electrons from the 
surface of the metal. 

The light quantum that has to be postulated to understand the photoelectric 
effect moves with the velocity of light. Hence it follows from Einstein’s Theory of 
Relativity that the rest mass of the photon is equal to zero. 

If we set the rest mass equal to zero in the general relation for the total 
energy 


FE? =, (mo ey ae ee = h? @? (1.3) 


and express the frequency by the wave number k = w/c, the momentum of the 
photon follows as 


p=hk=ho/c , (1.4) 


or written as a vector identity, assuming that the direction of the momentum of 
the photon should correspond to the propagation direction of the light wave, 


oe (1.5) 


1.3 The Compton Effect 


When X-rays are scattered by electrons, a shift in frequency can be observed, the 
amount of this shift depending on the scattering angle. This effect was dis- 
covered by Compton in 1923 and explained on the basis of the photon picture 
simultaneously by Compton himself and Debye. 

Figure 1.3 illustrates the kinematical situation. We assume the electron is 
unbound and at rest before the collision. Then the conservation of energy and 
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momentum reads: 


2 

Mo 
oo i, coe 1.6 
lure 0) (1.6) 

eh ee 


To obtain a relation between the scattering angle @ and the frequency shift, we 
divide (1.7) into components parallel and vertical to the direction of incidence. 
This yields, with k = w/c, 


ho ho en Mov q 
—= cos a 
7 Z i- cos@ an (1.8) 
h a 
a ee ee (1.9) 


From these two component equations, we can first eliminate ¢ and then, by (1.6) 
the electron velocity v (6 = v/c). Hence for the frequency difference we have 


o-oo = 5 ey sin’ 5 (1.10) 


If we put w = 22c/A, we obtain the Compton scattering formula in the usual form 
with the difference in wavelength as a function of the scattering angle 0: 
h a 
4 —A=4n—sin*= . ; 
Ee sin 5 CEG) 

The scattering formula shows that the change in wavelength depends only on 
the scattering angle 0. During the collision the photon loses a part of its energy 
and the wavelength increases (/’ > A). 

The factor 22h/moc is called the Compton wavelength 1, of a particle with rest 
mass Mo (here, an electron). The Compton wavelength can be used as a measure 
of the size of a particle. The kinetic energy of the scattered electron is then 


helo 

Bee et i? 
T = how — ha = (5 z) i (112) 
or (see Fig. 1.4) 

_ DAP Sine) 2 

eosin: 92 


Thus the energy of the scattered electron is directly proportional to the energy of 
the photon. Therefore the Compton effect can only be observed in the domain of 
short wavelengths (X-rays and y-rays). To appreciate this observation fully, we 
have to remember that in classical electrodynamics, no alteration in frequency is 
permitted in the scattering of electromagnetic waves; only light quanta with 
momentum hk and energy hw make this possible. Thus the idea of light quanta 
has been experimentally confirmed by the Compton effect. A relatively broad 


WM (1.13) 


Fig. 1.4. The Compton ef- 
fect energy distribution of 
photons and __ electrons, 
showing dependence on the 
scattering angle 


Fig. 1.5. The Ritz combina- 
tion principle 


| 

! 

| 
4. 


Fig. 1.6. Current (J )-voltage 
(eV) characteristics for the 
Franck-Hertz experiment 
showing regular maxima 
and minima 
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Compton line appears in the experiment, due to certain momentum distribu- 
tions of the electrons and because the electrons are bound in atoms. 

The Compton effect is a further proof for the concept of photons and for the 
validity of momentum and energy conservation in interaction processes between 
light and matter. 


1.4 The Ritz Combination Principle 


In the course of investigations of the radiation emitted by atoms, it appeared 
that characteristic spectral lines belong to each atom and that these lines can be 
formally arranged into certain spectral series (for example, the Balmer series in 
the hydrogen atom). The Ritz combination principle (1908) states that new 
spectral lines can be found by additive and subtractive combination of two 
known spectral lines. The existence of spectral lines means that transitions 
(of the electrons) between discrete energy levels take place within the atom. 

The frequency condition E = hw yields an explanation for the Ritz combina- 
tion principle. Considering the transition of an atom from a state with energy E, 
to a state with energy E,, (Fig. 1.5), we have 


ho,, = E, — E, = E; — E, + EE, — Ep (1.14) 
or for the frequencies, 
Gn a se eyo (is) 


In the Figure, the energy levels and the corresponding transitions are 
represented schematically. The spectral series result from transitions from differ- 
ent higher energy levels into a common “ground state” E,,. Thus, the spectral 
analysis of atoms suggests quite clearly that only discrete energy levels exist in 
an atom and that energy can be transferred only by light quanta with a definite 
energy. 


1.5 The Franck-Hertz Experiment 


Another experiment demonstrating the quantization of energy was performed 
by Franck and Hertz in 1913, using a triode filled with mercury vapour. The 
triode consists of an axial cathode K in a cylindrical grid A closely surrounded 
by a third electrode Z. The electrons are accelerated between K and A to reach 
Z through the anode grid. A small countervoltage prevents very slow electrons 
from reaching Z. The experiment yields a current-voltage characteristic between 
K and Z as shown in Fig. 1.6. 

As long as the energy of the electron in the field does not exceed 4.9 eV, the 
electrons can cross the tube without a loss of energy. The exchange of energy due 
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ee 


to elastic collisions between electrons and mercury atoms can be neglected. The 
current increases steadily, but as soon as the energy of the electrons has reached 
4.9 eV, the current drops drastically. A mercury atom obviously can take up 
exactly this much energy from the electrons in a collision. Thereafter an electron 
has insufficient kinetic energy to reach the second anode Z and the atom emits 
this energy with the characteristic wavelength of 2 = 2537 A. On increasing the 
voltage further, the electrons can regain kinetic energy and the process repeats 
itself. 

The Franck-Hertz experiment shows the existence of discrete energy levels 
(quantization of energy) in the mercury atom. 


1.6 The Stern-Gerlach Experiment 


In their experiment performed in 1921, Stern and Gerlach observed the splitting 
of an atomic beam in an inhomogeneous magnetic field. If an atom possesses 
a magnetic moment m, it will be affected not only by a torque, but also by a force 
F when in an inhomogeneous magnetic field H. The potential energy in the 
magnetic field is given by V = ~— m-H,; the force is given by the gradient, i.e. 
F= — grad V = gradm:H. 

In the experiment, a beam of neutral silver atoms was sent through an 
inhomogeneous magnetic field and the distribution of the atoms after passing 
the field was measured (for a detailed discussion see page 231 ff.). Classically one 
would expect a broadening of the beam, due to the varying strength of the 
magnetic field. In practice, however, the beam is split into two distinct partial 
beams. The intensity distribution on the screen is shown qualitatively in Fig. 1.7. 

This doubly peaked distribution obviously means that the magnetic moment 
of the silver atoms cannot orient itself arbitrarily with respect to the magnetic 
field; rather, only two opposing orientations of the magnetic moment in the field 
are possible. This cannot be understood classically. Obviously the phenomena 
of quantization appearing in the atomic domain are not restricted to energy and 
momentum only, but are also found in other physical quantities. This particular 
quantization is called directional quantization (or quantization of the angular 
momentum, see page 78.). 


1.7 Biographical Notes 


HERTZ, Heinrich Rudolf, German physicist, *Hamburg 22.2.1857, fBonn 1.1.1894, 
a professor of physics in Karlsruhe and Bonn, confirmed with his experiments concerning 
the propagation of electromagnetic waves the predictions of Maxwell's electromagnetic 
theory of light in 1887/88. He discovered the so-called Hertz waves, which are the 
physical fundamentals of modern radio engineering. He proved the influence of ultra- 
violet light on electrical discharge (1887), which led to the discovery of the photoelectric 
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Fig. 1.7. Intensity distribu- 
tions of the Ag atoms after 
their transition through an 
inhomogeneous magnetic 
field 
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effect by W. Hallwachs. In 1892 H. observed the transmission of cathode rays through 
thin metal plates and gave P. Lenard the task of explaining their nature. H. also gave an 
exact definition of hardness. 


LENARD, Philipp, German physicist, *PreBburg 7.6.1862, {Messelhausen (Baden- 
Wiirtt.) 20.5.1947, student of H. Hertz, was a professor in Breslau, Aachen, Kiel and 
Heidelberg. Using the window tube suggested by Hertz, L. was the first to investigate 
cathode rays as free electrons independent of the way they were generated and made 
a major contribution to the explanation of the nature of these rays. Among other things 
he showed that the rate of absorption of cathode rays is nearly proportional to the mass 
of the radiated substance. Furthermore he demonstrated that the velocity of electrons 
emitted due to the photoelectric effect is independent of the intensity of light, but depends 
on its frequency. Thus he created the experimental foundation for the fundamental 
photoelectric law formulated by Einstein. Of equal importance was his verification that 
the active centre of an atom is concentrated in a nucleus, which is tiny in comparison with 
the radial dimension of the whole atom. Later this fact was also experimentally proved by 
E. Rutherford. The explanation of the mechanism of phosphorescence and the proof that 
an electron must have a definite minimum energy to ionize an atom are further achieve- 
ments by L. He also introduced the “electron-volt” (eV) as a unit of measurement. In 1905 
he received the Nobel Prize in Physics. L. was as renowned an experimental physicist as 
his contemporaries J.J. Thomson and E. Rutherford, but was sceptical of Einstein’s 
Special Theory of Relativity. He rejected the Weimar Republic and gradually developed 
into a fanatical anti-semite and national socialist. [BR] 


EINSTEIN, Albert, German physicist, *Ulm 31.4.1879, +Princeton (N.J.) 18.4.1955. 
Having grown up in Munich, he moved to Switzerland at the age of 15. As a “technical 
expert third class” at the patent office in Bern, he published in 1905 in Vol. 17 of Annalen 
der Physik three most important papers. In his “On the theory of Brownian motion” he 
published a direct and conclusive proof, based on a purely classical picture, of the 
atomistic structure of matter. In his paper, “On the electrodynamics of moving bodies”, 
he set up with his profound analysis of the terms “space” and “time” the Special Theory of 
Relativity. From this he concluded a few months later the general equivalence of mass 
and energy, expressed by the famous formula E = mc’. In his third article, E. extended 
the quantum approach of M. Planck (1900) in “On a heuristic viewpoint concerning the 
production and transformation of light” and made the second decisive step towards the 
development of quantum theory, directly leading to the idea of the duality of particles 
and waves. The concept of light quanta was considered too radical by most physicists and 
was very sceptically received. A change in the opinion of physicists did not take place 
until Niels Bohr proposed his theory of atoms (1913). E., who became a professor at the 
University of Ziirich in 1909, went to Prague in 1911, and returned to Ziirich a year later 
where he joined the Eidgendssische Technische Hochschule. In 1913 he was called to 
Berlin as a full-time member of the Preussische Akademie der Wissenschaften and 
director of the Kaiser-Wilhelm-Institut fiir Physik. In 1914/15 he developed the General 
Theory of Relativity, starting from the strict proportionality of gravitational and inertial 
mass. As a result of the successful testing of his theory by a British solar eclipse 
expedition, E. became well known to the general public. His political and scientific 
opponents tried unsuccessfully to start a campaign against him and his theory of 
relativity. The Nobel Prize Committee therefore considered it advisable to award E. the 
1921 Nobel Prize in Physics not for his theory of relativity, but for his contributions to 
quantum theory. Beginning in 1921 E. tried to set up his unified theory of matter which 
aimed to incorporate electrodynamics as well as gravitation. Even after it had been 
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shown by H. Yukawa that other forces exist besides gravitation and electrodynamics, he 
continued with his efforts which, however, remained unsuccessful. Although he published 
a paper in 1917 which was instrumental to the statistical interpretation of quantum 
theory, he later raised severe objections, based on his philosophical point of view to the 
“Copenhagen Interpretation” proposed by N. Bohr and W. Heisenber. Several attacks 
because of his Jewish background caused E. in 1933 to relinguish all the academic 
positions he held in Germany; at the Institute for Advanced Study in Princeton, in the 
U.S.A., he found somewhere new to continue his studies. The final stage of E.’s life was 
overshadowed by the fact that although a life-long pacifist, fearing German aggression he 
initiated the development of the American atomic bomb by writing, together with others, 
to President Roosevelt on 8.2.1939. [BR] 


COMPTON, Arthur Holly, American physicist, *Wooster (Ohio) 10.9.1892, tBerkeley 
(CA) 15.3.1962, became a professor at Washington University, St. Louis, in 1920 and 
at the University of Chicago in 1923. In 1945 he became chancellor of Washington 
University. In the course of his investigations on X-rays he discovered the Compton effect 
in 1922. He and Debye simultaneously gave the quantum-theoretical explanation for this 
effect. C. was also the first to prove the total reflection of X-rays. Together with R.L. 
Doan, he achieved the diffraction of X-rays from a diffraction grating. Jointly with C.T.R. 
Wilson he was awarded the Nobel Prize in Physics in 1927. In cooperation with his 
students C. carried out extensive investigations on cosmic rays. During the Second World 
War he participated in the development of the atomic bomb and radar as director of the 
plutonium research project of the American Government. [BR] 


DEBYE, Petrus Josephus Wilhelmus, Dutch physicist, naturalized in America in 1946, 
*Maastricht (Netherlands) 24.3.1884, tIthaca (N.J.) 2.11.1966, was called “the Master of 
the Molecule”. In 1911 he became a professor at the University of Ziirich as successor to 
A. Einstein, then in Utrecht (1912-1914), Gottingen (1914-1920), at the Eidgendssische 
Technische Hochschule in Ztrich (1920-1927), in Leipzig (1927-1935), and was director 
of the Kaiser-Wilhelm-Institut ftir Physik in Berlin, 1935-1939, In 1940 he emigrated to 
the United States and became a professor of chemistry at Cornell University (Ithaca) in 
1948. There he directed the chemistry department from 1940 until his retirement in 1952. 
D. was famous both as a theoretical and as an experimental physicist. He formulated the 
T°? law for the decrease of the specific heat of solids at low temperatures. He developed 
the Debye-Scherrer method (1917 independently of A.W. Hull) and, jointly with 
E. Hiickel, formulated a theory of dissociation and conductivity of strong electrolytes. 
Independently of F.W. Glaugue and almost at the same time D. pointed out the 
possibility of reaching low temperatures by adiabatic demagnetization of ferromagnetic 
substances. During extensive research, he determined the dipole moments of molecules. 
This research together with results of the diffraction experiments of X-rays and electron 
rays from gases and liquids enabled him to establish their molecular structure; for this he 
was awarded the Nobel Prize in Chemistry in 1936. After his retirement, he developed 
methods to determine the molecular weight and the molecular expansion of giant 
molecules of highly polymerized substances. [BR] 


RITZ, Walter, Swiss physicist, *Sitten 22.2.1878, ¢G6ttingen 7.7.1909, formulated the 
combination principle for spectral lines in 1908. 


FRANCK, James, German physicist, *Hamburg 20.8.1882, }Géttingen (during a jour- 
ney through Germany) 21.5.1964. Franck was a member of the Kaiser-Wilhelm-Institut 
fiir Physikalische Chemie, and, beginning in 1920, a professor in Gottingen; he left 
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Germany in 1933. From 1935, F. was a professor of physics at John Hopkins University 
in Baltimore; 1938-1947, professor of physical chemistry in Chicago; from 1941 on, 
he was also active at the University of California. Jointly with G. Hertz, at the 
Physikalisches Institut in Berlin, F. investigated the energy transfer of electrons colliding 
with gas atoms. His results sustained Planck’s quantum hypothesis as well as the theory 
of spectral lines postulated by Bohr in 1913. For this work F. and Hertz were awarded the 
Nobel Prize in Physics in 1926. Extending these investigations, F. measured for the first 
time the dissociation energy of chemical compounds by optical means and determined 
the lifetime of metastable states of atoms. In addition he developed the law for the 
intensity distribution within a band structure, which is known today as the Franck- 
Condon principle. In the U.S.A. he devoted himself primarily to the investigation of 
photochemical processes within the living plant cell. During the Second World War F. 
worked on a project involving the technical utilization of nuclear energy. In 1945 he 
warned of the political and economic consequences of the use of atomic bombs in 
a petition which has become well known as the Frank Report. [BR] 


HERTZ, Gustav, German physicist, nephew of Heinrich Hertz, *Hamburg 22.7.1887, 
first was a professor in Halle and Berlin and.head of the research laboratory of the 
Siemens factories. From 1945-1954, H. built up an institute at Suchumi on the Black Sea 
together with former students and collaborators; in 1954 he directed a university institute 
in Leipzig. From 1911 on, together with J. Franck, he investigated the excitation of atoms 
by collisions with electrons; they shared the Nobel Prize in Physics in 1926. In 1932, 
H. developed the technique of isotope separation with a diffusion cascade consisting of 
many single steps. He applied this method to the extraction of uranium 235 on a large 
technical scale in the Soviet Union. [BR] 


STERN, Otto, German physicist, *Sorau (Niederlausitz) 17.2.1888, +Berkeley (CA) 
17.8.1969, became a professor in Rostock in 1921, and in Hamburg in 1923, where he also 
acted as director of the Physikalisch-Chemisches Institut. Beginning in 1915, S. de- 
veloped the method of using molecular rays for the determination of atomic and nuclear 
properties. He had particular success in discovering the quantization of the magnetic 
moment (the Stern-Gerlach experiment), in his diffraction experiments with molecular 
hydrogen and helium rays (1929), and in determining the magnetic moment of the proton 
(begun in 1933). S. emigrated to the United States in 1933 and worked at the Carnegie 
Institute of Technology in Pittsburgh. In 1943, he was awarded the Nobel Prize in 
Physics. [BR] 


GERLACH, Walther, German physicist, *Biebrich a. Rh. 1.8.1889, tMunich 1979, 
professor in Frankfurt, Tubingen and from 1929 in Munich, determined the value of the 
Stefan-Boltzmann constant by precision measurements (1916). Together with Otto Stern 
he showed the quantization of the magnetic moment by deflection of atomic rays in an 
inhomogeneous magnetic field (1921). At that time G. was a lecturer at Physikalisches 
Institut der Universitat Frankfurt a.M.; Otto Stern was visiting lecturer at the Institute 
fiir Theoretische Physik in Frankfurt, which was directed by Max Born (as successor to 
Max v. Laue) at that time. Futhermore, G. worked on quantitative spectral analysis and 
the coherence between atomic structure and magnetism. In extensive historical analyses 
of science, G. tried to point out the “humanistic value of physics”. [BR] 


2. The Radiation Laws 


The energy density o(w, 7) of the radiation emitted by a black body was 
described by two separate, contradictory theorems in classical physics. The 
Rayleigh-Jeans radiation law accounted for experiments in the region of long- 
wave radiation; Wien’s law for those in the region of short-wave radiation. By 
introducing a new constant hh, Planck was successful in finding an interpolation 
between the two laws. 

Planck’s radiation law covered the whole range of frequencies and contained 
the two other radiation laws as specific extreme cases (Fig. 2.1). In the beginning, 
Planck’s law was only an interpolation formula, but later he was able to show 
that this radiation law could be deduced under the assumption that the energy 
exchange between radiation and black body was discontinuous. The quanta of 
energy transfer are given by the relation E = hy, v being the frequency. From the 
historical point of view, this was the beginning of quantum mechanics. 

In the following we will explicitly discuss the various radiation laws. 


2.1 A Preview of the Radiation of Bodies 


If radiation hits a body, it can enter its interior or be reflected by its surface. The 
reflection is regular if the angle between the incident ray and the normal to the 
surface is the same as between reflected ray and normal and if all of these 
together, i.e. incident, reflected ray and normal, are situated in the same plane. 
On the other hand, if the rays are also reflected in other directions, we then 
speak of diffuse reflection. If the reflected part is the same in all directions, 
independent of the direction of incidence and of the colour of the light, then one 
calls the reflecting surface gray. If all the incident light is reflected without any 
loss in this way, the surface is white. 
A white surface element dF reflects (emits) the radiant flux 


J(w, T) cos 8 dF dQ dw 


into the solid angle dQ at an angle 6 relative to the vertical of dF. Its radiance 
J(@, T) is the same for all directions. The radiant flux is proportional to the 
cosine of the angle @ between the direction of reflection and the normal of the 
surface (Lambert’s cosine law). If a gray or white surface of arbitrary shape is 
illuminated from an arbitrary direction, it will reflect with the same apparent 


Emittance 
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he/AkT 


Fig. 2.1. Qualitative behav- 
iour of the emittance ac- 
cording to the laws of 
Rayleigh-Jeans and Wien: 
the spectrum of a_ black 
body as a function of 
ha/kT = he/kTA 
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2. The Radiation Laws 


Fig. 2.2. Radiation emitted 
by surface dF at angle 0 to 
the normal of the surface 
appearing to arise from the 
projected surface dF cos 0 


radiance in all directions. The quantity of light reflected by every surface element 
dF is proportional to the projection of dF cos 0 onto a plane perpendicular to 
the direction of reflection (Fig. 2.2). Therefore a white or gray surface seems to 
have the same brightness, seen from any direction, but a different size. 

The radiation which is not reflected at the surface of a body penetrates it, 
either passing through it or being absorbed by it. A body that absorbs all 
radiation that hits it without letting any part through or reflecting it, is called 
black. 


2.2 What is Cavity Radiation? 


Now we consider the radiation field that exists inside a cavity with walls formed 
by absolute black bodies with temperature 7. If the black walls emitted no 
radiation, then none could exist inside the cavity since it would very quickly be 
absorbed. However, it is an experimental fact that black bodies do emit light at 
high temperatures. Without any exact knowledge concerning this emission by 
black bodies, we can nevertheless draw various conclusions from its existence: 

1) After a short period of time, the radiation inside the cavity will reach 
a thermal equilibrium caused by the emission and absorption by the walls. If this 
equilibrium is reached, the radiation field will no longer vary. 

2) Everywhere in the cavity, the radiance J(w, T) is independent of the 
direction of the light rays. The radiation field is isotropic and independent of the 
shape of the cavity or of the material of its walls. If this were not true, we could 
place a black body in the form of a small disc having the same temperature as 
the walls into the cavity, and it would heat up if the plane of the disc were 
perpendicular to the direction in which J(w, 7) is largest. This, however, would 
contradict the 2nd law of thermodynamics. 

3) The radiation field has the same properties at each point of the cavity. 
J(@, T) 1s independent of spatial coordinates. If this were not the case, little 
carbon sticks could be set at two different points that have the temperature of 
the wall, a stick would absorb more of the radiation at a point where the 
radiation field is stronger than at a point where it is weaker. As a result, the two 
sticks in the cavity radiation would reach different temperatures. Again, this is 
not possible according to the 2nd law of thermodynamics. 

4) The cavity radiation hits all surface elements of the wall with radiance 
J(w, T). The surface has to emit as much radiation as it absorbs, therefore the 


2.2 What is Cavity Radiation? 


radiance of a black body is J(w, T). Thus the emission of all black bodies of 
the same temperature is identical and depends only on the temperature. Their 
radiance is independent of direction. The unpolarized light flux 


2J(w, T) cos 0 dw dF dQ 


is sent into a cone of aperture dQ, whose axis is inclined to the normal vector of 
the black surface element dF by an angle 6 (Fig. 2.3). The factor 2 is caused by 
the two possible polarizations of each ray. J(@, 7) depends only on temperature 
and frequency. For emission by black bodies, Lambert’s cosine theorem is valid, 
as it is for reflection by white surfaces. A glowing black body appears bright all 
over, seen from any direction. 

5) In a cavity enclosed by walls impenetrable to radiation, the same radi- 
ation exists as in a cavity with black walls. If we put a little carbon stick inside, it 
has to be in thermal equilibrium with the walls and the radiation. This will only 
be the case if the radiation is the same as for black walls. The radiation inside 
a cavity enclosed by impervious or black walls is called black body radiation. 

6) A single electromagnetic wave causes an energy density e which is related 
to the current |.$| by 


_|S| 
2=—— 
G 


(2.1) 


Linearly polarized radiation with a frequency between w and w + dw and 

a direction of propagation within a certain solid angle 2 and Q + dQ makes the 

following contribution to the energy density: 
J(@, T)dw dQ 


? 


(2.2) 
C 
and twice as much, if we take into account the two possible directions of 
polarization. By integrating over dQ2, we get for the energy density of the total 
radiation 
8x J(w, T) dw 

0(, T) do = ates (2.3) 
in the interval dew, since, for cavity radiation, J(w, T) is independent of the 
direction Q. Thus we obtain a general relation between the radiance J(@, T) and 
the energy density e(w, T) of the cavity radiation, namely, 


_ cQ(@, T) (2.4) 
= 


The following exercise will help in further understanding this relation. 


Ja. D) 
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Fig. 2.3. Cone of aperture 
dQ, whose axis is inclined 
to the normal vector of the 
surface element dF by an 
angle 0 


2. The Radiation Laws 


CL CLS. eae eS, 


2.1 On Cavity Radiation 


Problem. Clarify once more the relation between the radiance (intensity of 
emitted energy per unit solid angle) and the energy density e(@, T) of cavity 
radiation. 


Solution. Concerning energy and intensity: 
For a plane wave we have 


l 
t=- , E=QV=alA , 
c 


ES a QolA 
J = S— SC = : h It 
oi Ars AW ae hia () 


Oo = energy density of radiation for a single plane wave, 

Jo = intensity = radiant power per unit area for a single plane wave, 
E = QoV = radiant energy of the volume V for a single plane wave, 
V =1A = volume, 

P = E/t = radiant power. 


Let a cavity contain an isotropic electromagnetic radiation field. We would 
like to know the radiant power per area, i.e. the intensity of the radiation that 
emerges from the aperture of area A. 


A =area of emergence 


We construct the isotropic radiation by N plane waves with k-vectors 
pointing equally in all directions of the space. Then we have for any wave: 


Jo = CQo 


n; 1s the number of those & that point into the solid angle dQ; for which 
0; = Vs 0; + do; is valid. 

Now we have 

(Ale es dQ; i 


N cor Le aed 0; do; 6 (2) 


Because of the isotropy we have integrated over ~. Through A flows 


Po; = 2J9A cos 0; (3) 


2.3 The Rayleigh-Jeans Radiation Law 


(P = radiant power) per plane wave. The factor 2 appears because there are two 
degrees of freedom of polarization. Hence 


= Dain Neate 0; d0;2JoA cos 0; (4) 
Pop ANG, sid, cos.0;d0, ; (5) 


0; runs from 0 to 3. The sum can be replaced by an integral. 


n/2 


P,, 
Jiot = —- = N Jo | sin 0 cos 0 dO 
a 0 


1 nl? 
=e J sin 20.d0-= SN Jo} ~ 500 20| 


0 
1 tit 1 
For the total energy density 9, we have 
OS Ly 00 =2N Oo 
N 
Again, the factor 2 is due to the two possible polarizations per plane wave. 
Together with Jo = cQo we get 
1 t c 
Jot =e = 50 ee — 4° 
Therefore the total intensity is 
c 
4 


dhe = ie (7) 


It is emitted into a half-space with the solid angle Qy = 2x. The intensity per 
unit solid angle (the radiance) is therefore 


2.3 The Rayleigh-Jeans Radiation Law — 
The Electromagnetic Eigenmodes of a Cavity 


First of all we calculate the radiation density of a radiation field in thermo- 
dynamic equilibrium. The average energy per degree of freedom is then given by 
1k T. To determine the number of degrees of freedom of the radiation field given 
by the vector potential A, we consider a cubical volume of edge length a. We 
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Exercise 2.1 


Fig. 2.4. Radiation field as- 
sumed to be enclosed in a 
box 


2. The Radiation Laws 


now assume that the volume contains neither charges nor currents and has 
perfectly reflecting (mirrored) inner surfaces (hence the often-used name, cavity 
radiation). The vector potential obeys d’Alembert’s equation: 


ie | eee (2.5) 
aan CO = , : 


By separating off the time dependence, ie. A(r, t) = A(r) exp(i@t), we get for the 
space-dependent part of the function the Helmholtz equation: 


(4 +2) ae = (2 +k*)A(r)=0 . (2.6) 
Here, 
A(r,t) = A(r)sin(@t) or A(r, t) = A(r) cos(at) (2.7) 


and w/c is the wave number k = w/c. We do not want to solve the two wave 
equations explicitly, but use the boundary conditions of the problem to deter- 
mine the number of degrees of freedom. 

The vector potential is free of sources (Coulomb gauge); therefore 


div A =0 


This condition is equivalent to the transversality of the plane waves in the box. 
For every wave vector k, there exist two independent amplitudes A (polariza- 
tions) of the vector potential, each of them of the form: 


A(r)=Asin(k-r) or A(r)=Acos(k-r) with (2.8) 
A-k=0 or A,k,+ Ayky + A,k, =0 , and (2.9) 
okt kak aarle? . (2.10) 


For the components of the wave vector k,, k,, k;, we deduce conditions by 
demanding that the tangential components of A vanish on the reflecting inner 
surfaces (mirrors) of the cube. These conditions exclude the second solution (2.8) 
and yield for the first type 


sin(k,a) = sin(k,a) = sin(k,a)=0 , 
from which the wave numbers 


HT nyT A, 
k= Like —— 4 hy = — ee a, Nae (2.11) 
a a a 


follow. The numbers n,,ny,", are restricted to positive integer values only, 
because we are looking for stationary waves in the volume. The number of 
linearly independent functions A(r, t) in the (dn,, dn,, dn,) of number space is 
the volume element in number space itself, i.e. 


du dn.dn, (2) 


Together with n? = nz + n} + n?, we adopt spherical coordinates and get for 


2.4 Planck’s Radiation Law 


the number of lattice points in the first octant of the spherical shell (see Fig. 2.5) 
3) 


With (2.9) and (2.10) we get the number dN’(w) of independent solutions for 
the A field, situated in the frequency interval between w and wm + dw such that 


44nn? dn =42n? dn 


te qe f UES V 
5m Oy — *(“) ie ae w?dw=dN'(a) , (2.14) 


aT 
where V = a? is the volume of the cube. Now, if we also take into account the 
two directions of polarization for every electromagnetic normal mode, we finally 
get the density dN/dw of the possible electromagnetic states in a cavity, 
dN(q@) V 
= oo (Za) 


dw ee 


According to statistical thermodynamics, the average kinetic energy per degree 
of freedom is given by $k, T.' The energy per volume and frequency interval da 
is given by taking into account the two possible directions of polarization: 


dE 1 dN wo 
—— = fj =—-—— k,l k,l Ss ; 
Vda oe Vdw ” ee cro 
The spectral energy density is therefore 
kaT 
aloo, Tao (2.17) 
Ie: 


Using the relation between the emittance and the energy density of the radiation 
already deduced in (2.4), we can immediately write down the emittance: 


kaT 


82 c? 


J(o, T) do =< e(a, T)do = w? do (2.18) 


8x 
This is the Rayleigh-Jeans radiation law. 
The equations deduced here coincide with experiment only for low frequen- 
cies w. We can already see from (2.15) and (2.17) that they cannot be valid for 
high frequencies, since the energy density becomes infinite as w > ©. 


2.4 Planck’s Radiation Law 


In contrast with the classical derivation of the radiation density according to 
Rayleigh-Jeans, we now want to determine the density of photons. Photons are 
emitted or absorbed by the transition of an atom from one energy state to 


1 The average kinetic energy for an oscillator is also }kg7, but its mean potential energy 
is of the same magnitude (virial theorem). Therefore the average energy per frequency 
is kpT. 


Fig. 2.5. Two-dimensional 
illustration of the counting 


method. Every  eigenfre- 
quency is represented by 
a point with the coordinates 
n,n/a, nyn/a. The distance 
between the origin and this 
point is the wave number 
of the eigenmode. The two 
circles have radii k and 
k + dk, respectively 
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Fig. 2.6. Photon of fre- 
quency w being emitted in 
the transition from a state 


of energy E,, to a state of 
energy E,, 


2. The Radiation Laws 


another. The main problem here is the quantum mechanical calculation of the 
transition probabilities. However, it is possible to determine the proportions of 
the transition probabilities and to ascertain the energy density of the photon 
field by comparison with the Rayleigh-Jeans law, without explicit calculation of 
the transition rates. This derivation of Planck’s radiation law originates from 
Albert Einstein. 

Two of the energy eigenstates (energy levels) of an atom are represented in 
Fig. 2.6. The atom jumps spontaneously from the energy level E,, to the state of 
lower energy E,, and emits a photon of frequency w = (E,, — E,)/h. 

The photon has two different polarizations as a transverse electromagnetic 
wave. We identify them by the index « (7 = 1,2). The probability for the 
spontaneous transition is denoted by aj,,. The probability for the emission of 
a photon into a solid-angle element dQ is 


dWi=a",dQ . (2.19) 


The presence of photons in the radiation field in the vicinity of an atom in the 
state E,, stimulates it to an induced emission with the probability dW’. This has 
to be added to the spontaneous emission probability. The total probability is 
therefore 


dW, = dW' + aw’ 


The probability that an atom in the state E,, is able to absorb a photon of energy 
@ and move to a state E,, is denoted by dW. 

Einstein, who first made these considerations, set the probability of absorp- 
tion and induced emission proportional to the number of photons contained in 
the radiation field. The energy per element of solid angle and interval of frequency 
for photons of polarization a is again denoted by the spectral energy density 
Clo, 12): 

The number of photons in the frequency range between w and w + dw and 
in the solid-angle element dw is given by 


0.(, T, 2) da dQ 
ho 


For the probabilities of absorption and induced emission, we write analogously 
to (2.19) 


dW! = br, 0(0,T,Q)dQ , dW, =b%,0,(@,T,Q)dQ . (2.20) 


The coefficients b are the transition probabilities per unit spectral energy 
density. Therefore they have a different dimension from the transition proba- 
bility aj,,. 

Now we denote by N,(N,,) the number of atoiis in the state of energy 
E,(E,,). Then radiative equilibrium is characterized by 


N,(dWi + dW) =N,dW,. , (2.21) 


that is, the number of emission and absorption processes is equal. 
We insert (2.19) and (2.20) and get 


Ny( Dora Q2(@, T, Q) + alg) = Nyb™, Oq(0, T,.Q) (2.22) 


2.4 Planck’s Radiation Law 


The coefficients a’, b%,, bi, characterize properties of the atom and are related 
to each other. It is permissible however, to choose special conditions (like 
radiative equilibrium) to determine the relations between the coefficients, since 
the latter are not changed by doing so. 


The number of atoms in the state E,, is given by the Boltzmann distribution, 
i.e. we have the relation 
Nai Nm = exp( —E,,/kp T):exp( — E,,/kp T) 


Here, as before in Sect. 2.3, we have added the index B that distinguishes the 
Boltzmann constant kz from the wave number, so that (2.22) becomes 


exp( aa cn T) (oe 0,(c, r, Q) aP Dine) 
—cxp(—8,/ke! 00, @.(@, 1,2). (2.23) 


For very high temperatures T— co, the exponential function approaches | and 
the spectral energy density also becomes very large. Then we can neglect the 
term a", and obtain the relation 


Ding = Dra 
We rearrange (2.23) and get for the spectral energy density, by making use of 
On, = La — ns 

Ans 1 

~ pt, exp(ha/kgT)— 1 
A comparison with the Rayleigh-Jeans law for the spectral energy density, which 
is valid for low frequencies, gives the proportion of the transition coefficients. By 
expanding the denominator for hw/kgT < 1, we find 
. Ong Kp T 
Dee he 
Comparing now (2.24) and (2.17), we have to pay attention to the fact that 
Q,(, T, Q) is the spectral energy density per polarization and solid angle: 


O.(@, 72) 


@,(w, T, 2) = (2,24) 


1 
Oa(@, 18 Q) = o- o(a, T) 


Hence we obtain 
5 
a 1 ho 


7 Oo po 
De Sir 2c 


and therefore 
hw? 1 
Cle ae Ge een(hoyhaT) = 
By integrating over the solid angle and adding the two directions of polariza- 
tion, it follows that 
hw? 1 
nc? exp(hw/kgT) — | 


(2.25) 


(2.26) 


e(w, T)= 
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2. The Radiation Laws 


Corresponding to the derivation of (2.18), we get for the radiance: 


haw? 1 


SL); a ee ee 207 
Rate? Cha CHO areal lige 


J(a, T) do 
For high frequencies ho > kyT, Planck’s radiation formula turns into Wien’s 
radiation law. Indeed, (2.27) yields 


ho? 
O(a.) = aes exp( —haw/kp,T) (2.28) 


for hw > k,T. 

Let us consider the number density of the photons in the two limits of the 
Rayleigh-Jeans and the Wien radiation laws. With the frequencies @, and 2, 
where w, > @,, we obtain from (2.17) 


im Ors(1, T) ot kp T 


> See ee “< 
and from (2.27) 
Of 2 
dNw = Ow(@a, T) = ee) exp( —haw,/kp aa) dw . (2.30) 


Sire 
The relation between the photon number densities follows as 


dNw _ exp( — ha@,/kgT) ho 
dNpy = kpT@, 


gil. (2.31) 


Since iw,/kgT > 1, the exponential function makes the ratio of the number 
densities small. We can conclude from this that the wave character of light 
always becomes evident when many photons of low energy are present. The 
particulate character of light becomes noticeable in the case of photons of high 
energy and low number density. 


—V—US 7? —_—_— > _ ESS 


2.2 The Derivation of Planck’s Radiation Law According to Planck 


It is not only of historical but also of physical interest to see on which arguments 
Planck based his radiation law. This derivation is different from Einstein’s (done 
some years later) about which we have just learned. 

The radiation in a cavity whose walls are kept at a temperature T is 
homogeneous, isotropic and unpolarized. Its energy density E/V and frequency 
distribution of the energy density 


E/V = | 0(@, T)dw (1) 


are determined by the temperature alone. This is clear, but can also be verified in 
detail (Kirchhoff’s theorem); we will not do so here. 


2.4 Planck’s Radiation Law 


The energy of the radiation in a cavity can be interpreted as the energy of 
the electromagnetic field; the frequencies that occur are the eigenoscillations 
(resonances) of this field. Let V be the volume of the cavity and 


dN 
Te (w) da (2) 


w = 


be the number of eigenoscillations of the field in the frequency range dw; 
furthermore, let é(@, T) be the mean energy of an eigenoscillation of frequency 
w at temperature 7. Then the energy content of the cavity in the frequency 
interval da is 


IN 
Vo(w, T)dw = - (w)é(w, T)da ; thus 


a) 
(o, T)= 1 dN \é re 
0 (a, aoe (m)é(@, T) . (3) 


The number of eigenoscillations is given [see (2.15) ] by 


dN(a) ae 
ae = ae a) (4) 
and therefore 
1 
e(@,T)==4w7so,T) . (5) 
eC 


Because the eigenoscillations of the electromagnetic field are harmonic, each 
eigenoscillation has two degrees of freedom, corresponding to the two polariza- 
tion states of the radiation. Applying the theorem of equipartition of energy 
from statistical thermodynamics, the mean energy é of each eigenoscillation is 


é=kyT . (6) 
Hence the Rayleigh-Jeans radiation formula follows: 


(age 
o(w,T)=+507 . (7) 
Me” (C 


For small values of hw/kgT, this is in agreement with experience, but it fails 
completely for higher frequencies. In particular, it does not show the character- 
istic decrease in energy density towards higher frequencies (see Fig. 2.1). That 
reality is at variance with the theoretical radiation formula (7) means that 
eigenoscillations with larger values of hw/kgT contain less energy than is 
expected from the law of equipartition. 

Planck replaced this rule (6) with a completely different one: The energy of 
a harmonic oscillation is an integer multiple of an energy step proportional to 
the frequency: 


&,=nha ; n=0,1,2,... - (8) 


Every energy state defined here is to be treated as distinct from every other one. 
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Example 2.2 


Mean energy and _ specific 
heat of an oscillator 


2. The Radiation Laws 


Evaluating the mean energy in thermal equilibrium gives 


Introducing the equipartition sum 


Vie ome 5 pa Vk 3 


as an abbreviation, (9) becomes 


a Ginz 
From the evaluation of 

z= Lerp( ~fho)n] =~ 
we get 

7 ha 

b= SS 


exp(ha@/kgT) — | 
and with (5) we have Planck’s radiation law: 


1 ha? 
nc? exp(ha/kgT) — 1 


Clon!) 


This is (of course) identical to (2.27) given before. 


(9) 


(10) 


(11) 


(12) 


(13) 


(14) 


The figure below illustrates the dependence of the functions (13) and (14) on 
temperature (abscissa kgT/ha@, ordinate é/hw). Function (13) [or (14)] can be 
approximated for small ia@/kg7 by a power series of the exponential. Then the 


mean energy becomes 


— kpT 


For small hw/k 7 it makes no difference whether the energies of the 
eigenoscillations have the discrete values given by (8) or have continuously 


distributed values, as in classical theory. 


The approximation in the limit of large ha/kgT 


& = ho exp( — ha/kgT) 


2.4 Planck’s Radiation Law 


leads to Wien’s radiation formula: 


3 


hw 
= exp( — hw/kgT) 


O(o, f= 


The factor [exp(hw/kg7) — 1]~* in the mean energy (13) can be interpreted as 
the number of photons in a state characterized by the photon energy hw. This is 
important in Exercise 2.3. 

Planck’s hypothesis, that for harmonic oscillations only the energies 
E = nho are legitimate, is in contradiction with the intuitive idea of an oscilla- 
tion. His ingenious guess of the correct quantized states of the harmonic 
oscillator led to the breakthrough; that the introduction of the quantum of 
action ft coincides in general with the non-classical description of microsystems 
became clear only in the course of later developments. 


CREE 


2.3 Black Body Radiation 


From the foregoing, we know the density per frequency interval dN(w)/dw of 
the electromagnetic field in a cavity resonator with volume V (2.15), ie. 


dN(oyy 
dw We 


2 
3 


Here, the two directions of polarization are included. 


Problem. a) Consider the cavity as a container of photons and calculate the 
spectral distribution +(dE/dw) w of the black body radiation that escapes from 
a tiny hole in the cavity. Consider that photons are spin-! particles (bosons) and 
that the number of photons in a state of energy E at temperature T is given by 


far = (e*/**? — 1)~! (the Bose-Einstein distribution). 


Compare the result with Planck’s law. 

b) Show that the total electromagnetic energy in the cavity with walls kept 
at temperature T is proportional to T*, and evaluate the factor of propor- 
tionality. 


Hint: 
— d (a ee ee Siege 
= x xe" & = 
=a ‘i 0 : Ni n=0 
; co 2, @) cO il cO 
Hee eC ee ee 
ib | 2X (n oP 1)* J i 


n* 


- oa r= 
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Exercise 2.3 


d§2 = sin 6 dédp 


Emittance in time interval dt 
from a surface element df 


2. The Radiation Laws 


Solution. a) In a state with energy E at temperature 7, we find 


fox = Lexp(ia/kgT) — 1]! (1) 
photons with the energy E = ha. The density of states is dN/dw; therefore, the 
total energy in the resonator is given by 


dn dN ae: | 
— = — a 
dw Jo do we exp(hwm/kyT) — | 2) 


and the energy density for the volume V and frequency interval da is 


ldE(w) 1 dN(@) , _ ho? 1 


a = a 3 
V dw Vi do we exp(hw/kpT) — 1 ©) 
This is exactly Planck’s radiation formula [see (2.26) ]. 
b) The total energy in the cavity is given by 
eS dE() | 4a aw? dw 
a dw a a] exp(h 
0 p(hw/kgT) — 1 
ace 
iis 
if e 
Vea 2 
= see T* (the Stefan-Boltzmann law) . (4) 
ISh°c 
Then the energy density dE/dV in the cavity is 
dE E 
= ieee 
dV Vv 
ies 
Siar sos es rece ‘ (5) 


This gives rise to homogencous, isotropic radiation of density K, where K is 
given by 
é dk 


iy 


2 


The emittance is then 


c E(T) 
T) =-—— (ee 
“(T)=l— = oT , (6) 
where o = 5.42x 10° (ergcm” 7s~'K~*). Indeed, the total energy emitted 
from a surface element df in the time interval dt in the forward direction (see 
Figure) turns out to be 


n/2 2 


dE = { { sin0d0 dp K(cos 6 df) dt 
0 0 


n/2 2n 


=Kdfdt | § d0dpcos 0 sin0 = xK df dt 


0 0 


2.4 Planck’s Radiation Law 


The total power per surface area, the intensity, therefore is given by 
dE _cdE 


ca 


== = 7K = g 
| arr ™ 
The radiation of stars is approximately described by black-body radiation. 
Therefore the Stefan-Boltzmann law can be used to estimate the stellar surface 
temperature if we measure the radiation energy per cm? perpendicular to the 
direction of emission. 
For example, consider the sun. Its radius R is 0.7 x 10'! cm. Therefore the 
total emitted radiation energy is given by 
AO le (542 <0) F 3.34% 10'° 7 eress Fm Kelvin: (8) 


The radiation energy hitting | cm? of the Earth’s surface in one second, taking 
the mean Sun-Earth distance to be 1.5 x 10’? cm, is calculated thus: 
S340 7 ere erg 
——__________. = = 0,96 x 10°? T* 5 
Ags 1-5 10.-))- cin sec cm? 
This quantity is called the solar constant and is determined experimentally. Its 
measured value is 1.94 calcm~ * min” '. Transformation of units 


cal _—_—«:1.94x4.2x 10’ erg 


é 


(9) 


94 ——_— = =1.36< 10° erecin sa 10 
a cm? min 60 cms 3 : (os) 
easily gives 

fF =]=152- 10K) or TL GO0OK | (11) 
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2.4 Wien’s Displacement Law 


Problem. Derive Wien’s displacement law, 1.c. 
ee — se OTS 


from Planck’s spectral energy density #dE/dV. Here Amax is that wavelength 
where tdE/dw achieves its maximum. Interpret the results. 


Solution. We are looking for the maximum of Planck’s spectral distribution: 
a Wd d hoo? (=) i” 
| Sr | Son ©). 9) || <= || = 1 
dw| Vdw GO| C kyT 
3hw? ho he 
eer 


ho? h exp(hw/kpT) z 
nc kpT [exp(ha/kyT) — 1]? 


ha ha ha as 
[ei sates peceaceany | De | =0 . 1 
me i oxp( 7) [exp (4) | () 
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Exercise 2.4 


Al 0 ‘| z go 


The crossing points of the 
two curves yield the solution 
to (2) 


With the shorthand notation x = haw/k,T, we get the transcendental equation 
eal! , 2) 


which must be solved graphically or numerically. Besides the trivial solution 
x = 0 (minimum), a positive solution exists (see Figure). Therefore 


Roe. 


eae S 3 
X max ee ° ( ) 
and beedlise Ou — 20a — 2c) 1-8. We lade 

Ae const — 020i cme (4) 


This means the wavelength emitted most intensely by a black body is inversely 
proportional to the temperature of the body. Ama, shifts (“displaces”) with the 
temperature, this is why it is called a “displacement law”. This law may be used 
to determine the temperature of bodies (stars), too. 
Inserting the solar surface temperature T ~ 6000 K (see Exercise 2.3) into 
Wien’s displacement law, we find , 
0.29 


——~ om = 48x 1075 cm = 4800 A 


5 
max 6000 > ( ) 


where 1 A= 1078 cm. This is approximately the wavelength of yellow light. 
These estimates are within 20% of the exact values. 


EXER 


2.5 Emitted Energies of a Black Body 

Problem. Calculate the proportion of energy emitted by a black body radiator 
at T = 2000 K in two bands of width 100 A, one centred at 5000 A (visible light) 
and the other at 50000 A (infrared). 


Solution. We define 2, = 5000 A, 4, = 50000 A, 44 = 50 A, and calculate 


AE, 1+ |dE| [1 *4+|dE dE dE 
Sees aye | ielm@e | = (1) 
AE eas Ve eae: GAN ere | dA) |e 
Because w = 22c/A, we get 
dE dEldw} _ h(2ne/Ay h2nc ; ~ Dre 
di dw|di| oe? | kg TA 2 
8zhe he -: ; 
= ex - 
on ae () 


and with he = 12400 eV A, k = 8.62x 10° 5eVK~!, it follows that W = 5.50. 
Thus, only a small fraction of energy is emitted as visible light. 


2.5 Biographical Notes 


NAV Se 


2.6 Cosmic Black Body Radiation 


During the last decade, black body radiation has gained special importance. In 
the late 1940s, George Gamoy, first by himself, but later with R. Alpher and H. 
Bethe, investigated some consequences of the “Big Bang Model” of the creation 
of the Universe. One of those consequences was that the remnants of the intense 
radiation field created in the beginning should be present as a black body 
radiation field. Calculations predicting such a radiation field at a temperature of 
25 K proved unreliable. Until 1964, no attempts had been made to measure this 
radiation. Then A.A. Penzias and R.W. Wilson discovered strong thermal noise 
with their radio-astronomical detector, and renewed interest in this problem 
arose. A group under the leadership of R.H. Dicke, consisting of P.J. Peebles, 
P.G. Roll and D.T. Wilkinson, performed measurements of cosmic background 
radiation and understood at once the meaning of the thermal noise. It corres- 
ponds to black body radiation which today we believe to be of 2.65 + 0.09 K. 
The measurements were not easy to make, because any antenna is deluged by 
signals from the Earth’s surface, from the atmosphere, from several cosmic point 
sources and also by the noise generated by the electrical circuits in the measur- 
ing apparatus. In 1945, Dicke had constructed an instrument to measure 
radiation that could be used for these experiments. His idea was to construct 
a radio receiver that switches back and forth between the sky and a bath of 
liquid helium, 100 times a second. The receiver’s output signal is filtered: only 
signals varying with a frequency of 100 Hz are measured. These represent the 
difference between radiation from space and liquid helium. By varying the 
measuring apparatus, the atmospheric component could be separated out. 

The experimental verification of radiation at a temperature corresponding to 
the calculations made by Dicke and his collaborators is one of the strongest 
arugments in support of the “Big Bang Model” (see Figure below). More precise 
measurements will determine how fast we (Earth, solar system, local group of 
galaxies) move relative to the background radiation. Currently, our velocity 
relative to the radiation field is less than 300 kms’ ', roughly corresponding to 
the velocity of the solar system with respect to the local group of galaxies, caused 
by the rotation of our own galaxy. 


2.5 Biographical Notes 


RAYLEIGH, John Williams Strutt, 3rd baron R., English physicist, *Langford Grove 
(Essex) 12.11.1842, +Terling Place (Chelmsford) 30.6.1919, was a professor at the Caven- 
dish Laboratory in Cambridge from 1879-84, at the Royal Society in London from 1884 
to 1905. R. investigated, among other things, the intensity of sound by measuring sound 
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Measurements of the back- 
ground radiation in ergs"? 
cm! steradian7! Hz? as 
a function of wavelength 
in cm. The drawn curve is 
the predicted spectrum for 
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pressure exerted on an easily movable plate (Rayleigh disc), deduced the blue colour of 
the sky was caused by the scattering of light by the molecules of the air (Rayleigh 
scattering) and postulated a radiation law in 1900, known as the R.-Jeans law, represent- 
ing a special case of Planck’s law. Discrepancies in the measurements of the sound 
velocity of nitrogen led him and W. Ramsay to the discovery of argon in 1894, which was 
rewarded with the Nobel Prize in Physics and Chemistry in 1904 [BR]. 


JEANS, James Hopwood, Sir, English mathematician, physicist and astronomer, 
*Southport 11.9.1877, tDorking 16.9.1946, was a professor of astronomy at the Royal 
Society from 1912-1946. J. did pioneering work mainly in the fields of thermodynamics, 
stellar dynamics and cosmogony. He wrote natural philosophical works and also fasci- 
nating popular astronomical books [BR]. 


PLANCK, Max, German physicist, *Kiel 23.4.1858, }G6ttingen 4.10.1947, received his 
doctor’s degree at the age of 21 after submitting a thesis on thermodynamics. In 1885 he 
became a professor in Kiel; in 1889 he was a professor of theoretical physics and 
continued to work long after his retirement. During his studies on entropy in 1894 P. 
devoted himself to thermal radiation. In doing so, he discovered (not later than May, 
1899), while still of the opinion that Wien’s radiation formula was correct, a new constant 
of nature, Planck's quantum of action. In mid-October, 1900, he deduced his radiation 
formula by an ingenious interpolation, which turned out to be the correct law of 
black-body radiation. The 14th of December, 1900, when P. reported on the derivation of 
this formula from the principles of physics at the meeting of the Deutsche Physikalische 
Gesellschaft in Berlin, is considered as the “birthday of quantum theory”. While P. 
remained sceptical concerning Einstein’s light quantum hypothesis, he immediately 
recognized the importance of the theory of special relativity established by Einstein in 
1905; it is due mainly to P. that it was so quickly accepted in Germany. In 1918 he was 
awarded the Nobel Prize in Physics. Because of his scientific works and straightforward 
and uncompromising character, and because of his gentlemanly behaviour, he occupied 
a unique position among German physicists. As one of the four permanent secretaries, he 
directed the PreuBische Akademie der Wissenschaften for more than twenty-five years. 
For many years he was president of the Deutsche Physikalische Gesellschaft and 
copublisher of the “Annalen der Physik”. The Deutsche Physikalische Gesellschaft 
founded the Max Planck medal on his 70th birthday; P. was the first to win this award. 
After the end of the Second World War, the Kaiser Wilhelm Gesellschaft zur Férderung 
der Wissenschaften, of which P. had been president for seven years, was renamed the Max 
Planck Gesellschaft zur Férderung der Wissenschaften e.V. [BR]. 


WIEN, Wilhelm, German Physicist, *Gaffken (OstpreuBen) 13.1.1864, +Miinchen 
30.8.1928, was a professor in Aachen, GieBen, Wurzburg, Miinchen; in 1893, still assistant 
of H.v. Helmholtz, he discovered his displacement law: in 1896 he published his important 
(though only approximately valid) radiation law which had already been found. For these 
works (papers) W. was awarded the Nobel Prize (1911); their continuation by M. Planck 
led to quantum theory. In 1896, W. turned to writing about particle beams: among other 
things, he identified the cathode rays as negatively charged particles; he noticed that the 
channel rays consist of a mixture of predominantly positive ions and determined their 
specific charge and velocity. He worked on charge transfer and glow processes, deter- 
mined the mean free path of the particles and the glow time of atoms glowing unper- 
turbed in high vacuum. As editor of “Annalen der Physik” (beginning in 1906) he greatly 
influenced the development of science [BR]. 


3. Wave Aspects of Matter 


3.1 De Broglie Waves 


Investigations on the nature of light showed that, depending on the kind of 
experiment performed, light must be described by electromagnetic waves or by 
particles (photons). Thus the wave aspect appears in the context of diffraction 
and interference phenomena, whereas the particulate aspect shows up most 
distinctly in the photoelectric effect. So for light, the relations describing wave- 
particle duality are already known. But what about material particles? Their 
particulate nature is rather obvious; do they also possess a wave aspect? 

In addition to their corpuscular properties, De Broglie assigned wave prop- 
erties to particles, thus transferring the relations known from light to matter. 
What is true for photons should be valid for any type of particle. Hence, 
according to the particulate picture, we assign to a particle, for example an 
electron with mass m, propagating uniformly with velocity v through field-free 
space, an energy E and a momentum p. In the wave picture, the particle is 
‘described by a frequency v and a wave vector k. Following de Broglie, we now 
speculate: since these descriptions should only designate two different aspects of 
the same object, the following relations between the characteristic quantities 
should be valid: 


hy — he and (3.1) 
hk 

— i Be 

qt Akl (3.2) 


We have seen in the preceding chapters that these equations are true for the 
photon (electromagnetic field); now they are postulated to be valid for all 
particles. Then to every free particle, understood in the above sense, a plane 
wave determined up to an amplitude factor A is assigned: 


(rea) —saexpihl( Ki — ai), (3.3) 
or, using the above relations, 
Ge eexp|i(per— 1)/h |: (3.3a) 


Following de Broglie, the plane wave connected to the particle has a wavelength 


eS : (3.4) 
kK op mv 
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where the second relation is valid only for particles with nonvanishing rest mass. 
Because of the small value of the quantum of action, the particle mass must be 
sufficiently small to generate a measurable wavelength. For this reason, the wave 
character of matter first appears in atomic regions. The phase 


a=k-r—ot (ns) 
of the wave w(r, t) (3.3) propagates with velocity u = F according to the relation 


d 
ag Ne ee Be F (3.6) 
dt 


Hence, we get for the magnitude of the phase velocity u(&A and uw have the same 
direction): 


|u| => : (3.7) 


In the following, we will show that matter waves — in contrast with electromag- 
netic waves — even show dispersion in a vacuum. We must therefore calculate 
w(k). The relativistic energy theorem for free particles 

1p Re oye 
can, for v < c, be put into the form 


aa ___ 2 

E= mc? = /mic* + p?c? = moc? + EAaDA 7S (3.8) 
2Mo 

With (3.1) and (3.2) we can give the frequency as a function of the wave number: 


moc? — hk? 


k)= 
w(k) h <p ae =F (9) 
Therefore, the phase velocity u = w/k is a function of k in a vacuum, Le. 
es Moc? 4 hk a: 3 
a on. ey 


so that the matter waves show dispersion even there, ie. waves with a different 
wave number (wavelength) have different phase velocities. On the other hand, 
for the phase velocity u, the following relation holds: 

o ho E_ me? ¢c? 


Because c > v, the phase velocity of matter waves is always larger than the 
velocity of light in a vacuum. Hence, it cannot be identified with the velocity of 
the assigned particles. Because these are massive, they can only propagate more 
slowly than light does. 

The group velocity is calculated using 


_ da d(hw) dE 
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3.1 De Broglie Waves 


(we shall prove this below). The variation of energy dE of the particle moving 
under the influence of a force F along a path ds is dE = F-ds and because 
F = dp/dt we therefore have 


d 
dE = sds = dp-v . (3.13) 


Since v and p = my are parallel, the following equations hold: 


dE 
Cee alan —"ap sor — — tb . (3.14) 
dp 
Hence, the group velocity of a matter wave is identical with the particle velocity, 
ie 
Us=v . (G5) 
We can also deduce this result in a different way. If we want to describe a particle 
as a spatially limited entity, we cannot describe it by a plane wave (3.3). Instead, 
we try to describe the particle by a finite wave packet, which, with the help of 
a Fourier integral, is written as a superposition of harmonic waves, differing in 
wavelength and phase velocity. For simplicity we investigate a group of waves 
propagating in the x direction 
ko + Ak 
wix,t)= f c(kjexp{ilkx — w(k)t]}dk . (3.16) 
ko Ak 
Here, ko = 22/o is the mean wave number of the group and 4k is the measure of 
the extension (frequency spread) of the wave packet, assumed to be small 
(Ak < kg). Therefore, we can expand the frequency «, which according to (3.9) is 


a function of k, in a Taylor series in the interval Ak about ko, and neglect terms 
of the order (Ak)" = (k — ko)", n = 2, Le. 


dw ido rN 
ok) = w(ko) + Gan (k — i+3(Sa)) (k — ko) +. 
(3.17) 


We take € = k — ky as new integration variable ¢ and assume the amplitude c(k) 
to be a slowly varying function of k in the integration interval 24k. The term 
(deo/dk), =k = Veg is the group velocity. Thus, (3.16) becomes 


Ak 
W(x, t) = exp {i[kox — olko)t}} J expLile — vg e]e(ko + )dé 
—Ak 


(3.18) 


Integration, transformation and the approximation c(kg + €) © c(ko) lead to 
the result 


iat) = CCa t) exp {ilkox — a(ko)t]} with (3.19a) 


sin [Ak(x — u,t) J 
xX — vet 


C(x, t) = 2c(ko) (3.19b) 
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Fig. 3.1. A wave packet: 
several rapidly oscillating 
waves superpose, thus gener- 
ating a group with finite 
extent 


Since the argument of the sine contains the small quantity 4k, C(x, t) varies only 
slowly depending on time t and coordinate x. Therefore, we can regard C(x, f) as 
the amplitude of an approximately monochromatic wave and kox — w(ko)t as 
its phase. Multiplying numerator and denominator of the amplitude by 4k and 
abbreviating the term 

Ak(x—v,th=z , 
we see that variation in amplitude is determined by the factor 


sin z 


7 
a 


This has the properties 


fim = 4 for 0 Oo ee eT 
Zz Ome Z 


If we further increase the absolute value of z, the function (sin z)/z runs 
alternately through maxima and minima, the function values of which are small 
compared with the principal maximum at z = 0, and quickly converges to zero. 
Therefore, we can conclude that superposition generates a wave packet whose 
amplitude is non-zero only in a finite region, and is described by (sin z)/z. Figure 
3.1 illustrates the form of such a wave packet at a certain time. 

The modulating factor (sin z)/z of the amplitude assumes for z > 0 the 
maximal value 1. Therefore, for z = 0 


lon, 


which means that the maximum of the amplitude is a plane, propagating with 
velocity 


ax 


ap =P (3.21) 


The propagation velocity of the plane of maximal amplitude has to be identified 
with the group velocity v,, which, as we determined earlier, is the velocity of 
energy transport. The group velocity is the velocity of the whole wave packet 
(“matter-wave” group). 

We can understand this in another, shorter way: if we demand |wW(x, t)|? 
in (3.18) to be constant, i.e. |y(x, t)|? = const., we conclude from (3.18) that 
vet — xX = const., and, hence, by differentiation, X = v,. Thus, the fixed constant 
value of |(x, t)|? moves with the group velocity vg. Differentiating the disper- 
sion relation (3.9) of w(k), we get for vg: 


=( 2) 2 (ete 3.22 
ae eur eee Mo ge ee é ee) 


From this, we must not in general conclude that the group velocity of a matter- 
wave group coincides with the classical particle velocity. All results derived up to 
now were gained under the simplification that all terms in the expansion (3.17) of 
co(k) higher than first order, can be neglected. This is allowed so long as the 


3.2 The Diffraction of Matter Waves 


medium is free of dispersion. Since de Broglie waves show dispersion even in 
a vacuum, the derivative d*w/dk* #0, ie. it is non-zero. This implies that the 
wave packet does not retain its form, but gradually spreads (each of the many 
monochromatic waves forming the packet has a slightly different frequency and 
therefore a different propagation velocity). If the dispersion is small, i.e. 

aa 

Fee Oss (3.23) 
for a certain time, we can assign a particular form to the wave packet. Then we 
can consider the matter-wave group as moving as a whole with the group 
velocity vg. 

Following de Broglie, we assign to each uniformly moving particle a plane 
wave with wavelength 4. To determine this wavelength, we start with de 
Broglie’s basic equations, (3.1) and (3.2). For the wavelength, the following holds: 


an 2nh_h 


= 3.24 
ees a? oo 
If we assume the velocity of the particle to be small v < c, and use the equation 
ae 
2Mo 
we get the wavelength 
: h 
ie (3.25) 


Ey Zing: 


meaning that we must know the rest mass of the particle in motion to determine 
its wavelength. If we consider, for example, an electron with kinetic energy 
E = 10keV and rest mass mo = 9.1 10~7° g, then its matter wavelength is 
4. = 0.122 A = 0.122 x 107 ® cm. 

As we know, the resolution of a small object under a microscope depends 
upon the wavelength of the light used to illuminate the object (see Examples 3.7 
and 3.8 later on). The shorter the wavelength, the shorter the distance is between 
two points that can be seen distinctly through the microscope. The wavelength 
of visible light can typically be chosen as 4, = 5000 A, permitting a magnifica- 
tion of about 2000 times. If electrons are used instead of visible light to “scan” an 
object, magnification of up to 500000 times and a resolution of about 5-10 A 
can be achieved. Finally, protons and mesons in the GeV region (10° eV) have 
wavelengths so small that it is possible to use them to investigate the inner 
structure of elementary particles. 


3.2 The Diffraction of Matter Waves 


Interference and diffraction phenomena are unique proofs of the occurrence of 
waves. In particular, destructive interference cannot be explained using the 
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Fig. 3.2. The principle of 
scattering matter waves at 
crystals 


Fig. 3.3. Scattering of a 
matter wave at a crystal 
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corpuscular picture. While the photoelectric effect and the Compton effect 
show the corpuscular nature of light, the diffraction of electron rays proves the 
existence of matter waves. 

Since the wavelength of electrons is too small for diffraction by an artificial 
grid, crystal lattices are used for scattering. These experiments are in general 
a repetition of the corresponding structural investigations performed with 
X-rays. 

Davisson and Germer applied Laue’s method for X-ray diffraction. Here, the 
surface of a monocrystal is used as a plane diffraction grid. The electrons are 
scattered at the surface of the crystal, but do not penetrate it. Figure 3.2 shows 
the experimental setup and the path of the electron rays. 

As can be seen in the figure, diffraction maxima appear if the condition 


nA = d sin @ (3.26) 


is fulfilled. If the electron passes through an accelerating voltage U, its energy is 
given by eU, and from (3.25) it follows that 


h 
ee /Usin (3.27) 
d./2moe 


which, indeed, is confirmed by experiment. 

Tartakowski and Thomson correspondingly used the Debye-Scherrer 
method of X-ray scattering (Exercise 3.1). Here, monochromatic X-radiation is 
diffracted by a body consisting of compressed crystal powder. The crystal 
powder represents a spatial diffraction grid. Because of the disordered arrange- 
ment, there are always crystals that comply with the bending condition. In 
Fig. 3.3 we can see the path of the rays. 

The diffraction maxima appear under the condition (the Wulf-Bragg 
relation) 


22 sino =n (3.28) 


Owing to the statistical distribution of the minicrystals in the crystal powder, the 
apparatus — and correspondingly the diffraction figures — are symmetrical with 


respect to the SO axis. Because of this radial symmetry of the interference 
patterns, circles appear around O on the screen. Obviously, the relation 
tan(20) = D/2L is valid, where L is the distance between scatterer and screen. 
The experimental setup is chosen in such a way that all angles are small, thus 
permitting the approximation tan(20) ~ 20. From the Wulf-Bragg equation 
we get 


IBY (3.29) 
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If electron rays are used, we insert the de Broglie wavelength (3.25) into the 
above relation and find that 
2nLh 
Dy ==. (3.30) 
d./2mige 
i.e. the square root of the accelerating voltage times the radius of the diffraction 
circles has to be constant for any order of diffraction. 
The experimental results were in perfect agreement with this formula. Now- 
adays, electron rays and, in particular, neutron rays are an important tool used 
in solid state physics to determine crystal structures. 
EXERCISE 
3.1 Diffraction Patterns Generated by Monochromatic X-rays 
Problem. (a) What are (schematically) the diffraction patterns generated by 
a monochromatic X-ray on an ideal crystal? 
b) The Debye-Scherrer method uses crystal powder rather than a crystal. 
What do these interference patterns look like? 
Monochromatic : 
X-rays 
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ae 2 5120 
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= 1280 40023! 422333 449531 620533 444 
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20 


Solution. a) An ideal crystal consists of completely regularly arranged atoms. 
Incoming radiation of wavelength / is reflected a little by each of these many 
grid planes. Macroscopic reflection occurs only if the reflected rays of several 
parallel net planes interfere constructively. Let d be the distance of any two 


Scattering by silicon: photo- 
graphy and diffractometer 
plot generated by the De- 
bye-Scherrer powder method 
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Exercise 3.1 


parallel planes; then reflection occurs only if Bragg’s condition 2d sin 0 = nd (n 
integer) is satisfied. Here 0 is the angle between the ray and the grid plane. Since 
we assume that we have an ideal crystal, the angle 6 is determined by the 
orientation of the crystal with regard to the ray direction. In the Bragg condi- 
tion, d, 0 and 4 are already determined. Hence there is in general no n that fulfils 
the condition. Normally, no reflection occurs! 

To overcome this drawback, one avoids monochromatic X-rays for structure 
analysis, but uses a continuous spectrum (Laue method) instead. In this case the 
diffraction patterns consist of single points which are regularly arranged. 
Another possibility is to change @ by turning the crystal (rotating crystal 
method). 

b) Here, monochromatic radiation is scattered by crystal powder. Therefore 
(see above) for most crystals no reflection occurs. Only for those crystals 
oriented accidentally in one of the scattering angles @ does constructive inter- 
ference occur, and the ray is deflected through 20. Because the crystals are 
uniformly distributed with respect to the azimuth, a cone of reflection results 
(see next Figure). 
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3.2 Scattering of Electrons and Neutrons 


Problem. a) Calculate the wavelengths of the (electromagnetic, probability, 
matter) waves of 10 keV X-rays, 1 keV electrons and 5 eV neutrons. 

b) Do the interference patterns change if X-radiation is replaced by neutrons 
of the same wavelength? 
Additional question: How can “monochromatic” neutrons be created? 


Solution. a) The wavelength 4 = 2z/k of a particle of mass m is related to its 
momentum p; the latter is determined by the total energy E using 


D(H ig c 
From the de Broglie relation k = p/h, we therefore have 
A = 2nh[(E/e)* — mge* |" 1? 


For photons mo = 0 and therefore A,, = 2nhc/E,,. In the nonrelativistic limit, 
we get for electrons and neutrons 


p= /2moExin , and thus 4 = 2h ./2moEyin 
Inserting this and using 

m= O911 105 se in, = 167 5 10 oe 

leV=1602x10°'*erg , 2xh = 6.62 x 10-2’ ergs 

1pm = 10°! m= 107!°cm= 1072 A 


5 


3.3 The Statistical Interpretation of Matter Waves 


35) 


we have 


Jon(1OkeV) = 120pm , Ai keV)=39pm , Ay(SeV) = 13 pm 
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b) Replacing the X-radiation by neutrons of the same wavelength leaves the 
scattering angles unaltered at first. But because the gamma rays interact with the 
electron distribution, whereas the neutrons interact with the atomic nuclei and, 
when ultimately coming into being, magnetic dipole moments, the relation of 
the intensities is different. 

c) To create monochromatic neutrons, Bragg reflection is again used. 
A polychromatic neutron ray strikes a monochromator crystal. At a certain 
angle 20, only a few wavelengths (nj = 2d sin #) are emitted (see Figure). In 
general one reflection (n = 1 and one fixed d) is much stronger than the others. 


3.3 The Statistical Interpretation of Matter Waves 


The question of how to interpret a wave describing a particle, and whether this 
wave should be assigned physical reality, was subject to discussion in the first 
years of quantum mechanics. A single electron acts as a particle, but interference 
patterns only arise if many electrons are scattered. 

Max Born paved the way for the statistical interpretation of the wave 
function describing a particle. He created the term guiding field (in German: 
“Fiihrungsfeld”) as an interpretation of wave functions. The idea actually 
originated with Einstein, who called it a Ghostfield (“Gespensterfeld”). The 
guiding field is a scalar function w of the coordinates of all particles and of time. 
According to the basic idea, the motion of a particle is determined only by the 
laws of energy and momentum conservation and by the boundary conditions 
which depend on the particular experiment (apparatus). The particle is kept 
within these set limits by the guiding field. The probability that the particle will 


Exercise 3.2 


Wavelength as function of 
particle energy for photons 
[keV], neutrons [0.01 eV] 
and electrons [100 eV] 


Incident 
radiation 


Crystal 


Principle of the creation of 
monochromatic neutrons by 
reflection 
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follow a particular path is given by the intensity, 1.e. the absolute square of the 
guiding field. In the case of electron scattering, this means that the intensity of 
the matter wave (guiding field) determines at every point the probability 
of finding an electron there. We shall now further investigate this interpretation 
of matter waves as probability fields. 

The square of the amplitude of the wave function w is the intensity. It ought 
to determine the probability of finding a particle at a certain place. Since y may 
be complex, whereas the probability is always real, we do not define y? as 
a measure of intensity, but instead 


Wel= yer, (3.31) 


where y* is the complex conjugate of w. In addition, the probability of finding 
a particle is proportional to the size of the volume under consideration. Let 
dW (x, y, z, t) be the probability of finding a particle in a certain volume element 
dV = dx dy dz at time t. According to the statistical interpretation of matter 
waves, the following hypothesis adopted: 


ey eet la (ney et lah 


In order to get a quantity independent of volume, we introduce the spatial 
probability density 


dw 
w(x, Vy 2; = ae | w(x, Vs 2, ig : (B32) 


It is normalized to one, i.e. the amplitude of w is chosen so that 
| RESTS (3.33) 


This means that the particle must be somewhere in space. The normalization 
integral is time independent: 


fine d 
A we ‘ a a 


otherwise, we could not compare probabilities referring to different times. The 
wave function yw can only be normalized if it is square-integrable, i.e. if the 
improper integral 


J WP dV converges, ie. | wW*dV<M_ , 
M being a real constant. The probability interpretation for the W field expressed 
in (3.32) is a small step, but nevertheless it is only a hypothesis. Its validity must 
be proved — and will be proved, as we shall see — by the success of the results 
it predicts. A state is bound, if the motion of the system is Fesinicteds ih ites 
not restricted, we have free states. In the course of this book we shall establish 
the following facts: the wave function W for bound states (E< 0) 1s square- 
integrable, whereas |y|? is not integrable for free states. This is intuitively 
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states 


E<0 


understood from Fig. 3.4 bound states are localized within the potential well 
and can propagate only within its interior; thus, they are confined. Free states 
are located above the potential well and are not bound. 

A normalized wave function wy is determined only up to a phase factor of 
modulus one, i.e. up to a factor e with an arbitrarily real number «. This lack of 
uniqueness stems from the fact that only the quantity py* = \w|?, the proba- 
bility density, has physical relevance. 

An example of wave functions that cannot be normalized according to the 
requirement (3.33) is the wave function 


Vir = Newer =e) (3.34) 


where N is a real constant. This plane wave describes the motion of a free 
particle with momentum p = hk and undefined locality. But we can normalize 
the function (3.34) if we define all functions within a large, finite volume 
considered to be a cube with an edge length L (box normalization): 


(3.35) 


= Nei(A'-@ for r within V= L? 
~ 18 for r outside V=L? 


Another method for normalizing such “continuum wave functions” will be 
presented in Chap. 5. 

On the surface of this volume, the wave functions must satisfy certain 
boundary conditions. We assume L to be large by microscopic standards 
(L > 10~*® m). Then the influence of the boundary conditions on the motion of 
a particle in the volume V = L3 is very small. Therefore, we can choose the 
boundary conditions in quite a simple, arbitrary form. Very often a periodicity 
with period L is selected as boundary condition; we require that 


Ws, y= We + Lx )=WGytLD=WeyztL) . (3.36) 
Now we determine the normalization factor N in (3.34), keeping in mind the 


definition (3.35): 


t= [ wyrtavan? [ dV=N?L? 
0 V=L3 


ay 


Fig. 3.4. Bound and con- 
tinuum states of a particle in 
a potential well 
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Hence, it follows that 
ae 1 
Jo 


Thus, we get the normalized wave function 


N= 


ther 


1 
Oe 


wir, t) = l eik-r — iw(k)t = wilrye io%! 
V 


(3.37a) 


The boundary conditions of our problem restrict the possible values of the 
vector k: 


De 
=—n 
IL 


k DS eee One (3.38a) 


Written as components, we have, respectively, 


2n 2n 21 
ke =E oa Ny; ky =e lly o k, == Ue » (3.38b) 


where n,,n, and n, take on all integer values. Hence, the momentum 
p = hk = (2h/L)n is quantized. The same is valid for the energy E = hw(k), 
and so for the frequency of the wave 

oe (= 


2) 
h 
= Fy +ny+ni) . (3.39) 


(K) = 2 
w(k) = L 


h 2hm 2m 


Inserting the values for A into the normalized wave function (3.16), we get 


AP) = ape fil(2n/L)n-r — w(k)t]} 


viata Qn Diy = Ot |) —satnem ae (3.37b) 
For these wave functions, we can explicitly check that the periodicity conditions 
(3.36) are fulfilled. Owing to the boundary conditions (3.36), the vector A (and 
therefore the momentum p = hk) takes on discrete values, given by the condi- 
tions (3.38). In the limit L — oo, the difference between neighbouring values of 
k (and of the energy (3.39)] converges to zero, and finally we return to the 
motion of a free particle in infinite space. 

Now we shall prove that these normalized wave functions w,(r, t) of (3.37) 
constitute an orthonormal function system, so that 


J ve Wa dV = Onn (3.40) 
; 
Here, only the spatial part of the wave functions w, (plane waves) from (3.37a) 


is considered. The factor containing the time dependence exp[ + ia(k)t] does 
not change anything in the orthonormality relation (3.40). We insert the wave 
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functions and calculate: 


| WE) Un (dV 
Vai 
eae, Liz Ge 
= 73 j ellks — ky)x dx i eltky —ky)y i eltks —k.)2 
=172 Sees S12 
1 2/2 L/2 
mae (exp wy AG —njixiide | exp il2nL@, —n,)yl} dy 
=ii)2 -L/2 
L/2 
| Sst = ede 
=I )2 
_ sin[a(n, — n,)] sin Pee, — 0) Sil ae 
n(n. — nx) Tn, — 2) n(n, — n,) 


= Ono On'.n, OR ee = OK’ k 


Obviously sin[z(n, — n,)] = 0 for ni. +n,. Therefore, only the cases where 
nL =n,, n, =n, and n, =n, contribute. Thus, the wave functions w,(r, t) of 
(3.37) indeed constitute an orthonormal function system. In addition, the yj, are 
a complete system, i.e. it is impossible to find an additional function @ that is 
orthogonal to all wy, in the sense of the relation (3.40). Then the following 
completeness relation is valid: 


f yy* dav = f IW" av = Yi lax? (3.41) 
Vv V : 


where the a, are the expansion coefficients of the arbitrary wave function y in 
terms of a complete set of the Wx, 


See (3.42) 
k 


If completeness (3.41) is proved (the proof is omitted here), we can always 
expand according to (3.42), ie. the Wx constitute an orthonormal basis of 
a Hilbert space. A Hilbert space is a finite or infinite complete vector space on 
the basic field of complex numbers. In this space a scalar product is defined such 
that it assigns a complex number to each pair of functions (x) and (x) out of 
4 set of linear functions. This scalar product meets four requirements: 


I 


1. (wid) iytoav=({oewar) = (ony 


2. <wlad, + bb2> adwioi> + b<w| 2) or 
[ W* (agi + b2) dV alw*d,dV +b | w*o.dV , (linearity) 
3. wlW> =fytyaveo 
4. from <W|y> (yew dy 07, 
follows y(x) =—a00 (3.43) 
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pe 


The state vectors (= wave functions) of a quantum mechanical system consti- 
tute a Hilbert space (hence, the Hilbert space is a function space). In the 
following we show that (3.42) leads to the completeness relation (3.41). We 
multiply (3.42) on both sides by its complex conjugate, integrate over the total 
space and use the orthonormality condition (3.40). Then we get 


fut dV =f ¥ apap ly Ve ha ay an | Wa We dV = Yo ag ap Oe 
Vv Jk Vv k,k’ 


V kk’ 
ic. we have the completeness relation (3.41). 
To determine the coefficients a, of the expansion (3.42) we multiply this 
equation by Wy, and integrate over the volume V: 


fue dV = ag Walp dV =¥ apdee = ae, te. ay = J We dV 
v Ee k 


(3.44a) 
With the help of the normalization integral, we obtain 
— f wy* ay = yy at an | We wp dV = oy GRR Oe = Y apa, 
-o kk’ Vv kk k 
Hence 
yiak= i (3.44b) 
k 


Now we interpret the quantity |a,|? as the probability of finding a particle with 
momentum 


pa=hk 


in the state w. This interpretation is very reasonable in view of (3.42) and the fact 
that the W;,(r, ¢) in (3.37) are wave functions with definite momentum p = hk. 


3.4 Mean (Expectation) Values in Quantum Mechanics 


In the following we will calculate the mean values of position, momentum and 
other physical quantities in a certain state if the normalized wave function yw is 
known. 


I. The Mean Value of Position Coordinates. Let a quantum mechanical system 
be in the state y. The position probability density is then given by the term yi*. 
The function of state w is normalized to unity. Thus the mean value of the 
position vector is given by 


Cry = J rb (u(r) dV = J pr (r)rb(r) dV 


V V 


Accordingly, we have for the mean (expectation) value of a function f(r), which 
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depends on r only, 
te OG rw(r)dV = ee (ryy(r) dV 


2. The Mean Value of Linear Momentum. \t has already been shown that an 
arbitrary wave function wy can be expanded in terms of an orthonormal basis of 
the Hilbert space {y,'. Then the absolute square of the expansion coefficients 
represents the momentum probability (3.44b) and the relation 


p= 2 a (hk) ax 


holds. By insertion of expression (3.44a) for a, into this relation for the expecta- 
tion value of the momentum, it follows that 


C= d ( ) dale wnt av’ K(f wins weedy | 5 OS (3.45a) 


Ga eas Vhkwp(rwraVdv' . (3.45b) 


It is easy to verify the relation 
hkw,(r) = ihV oF (v) (3.46) 


by using the wave function 
—iker 


ae 
which has already been introduced together with the corresponding boundary 
condition in (3.37). Inserting (3.46) into (3.45b) yields 


P= VJ Wale) dV! J AVY (WO) aV 
kYv’ Vv 
Now, according to the condition of periodicity (3. a the values of yw and yw, are 


equal at opposite planes (x = 0, L or y=0, L or z= 9, L) of the cube with 
a volume V = L?. Thus we get by partial eke of the x component, for 


example, 
nS] 1(ME) wads Jayde minty : pay des injug (SE av 


--aye( 8) 


The expectation value of the momentum p is then given by 


po=f J Junie —ihVp(r)) > War’) unk avav (3.47) 


Now we make use of the relation 


D vale WO) SO SIs (3.48) 
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which can easily be proved by expanding the delta function! 6(r’ — r)in terms of 
the complete set of functions W,(r) = V7? el*'" 


dr N= Va WW. (3.49) 
k 


and by calculating the expansion coefficients b,(r’). Multiplying both sides of 
(3.49) by w;-(r) and integrating over r gives 


ONE A= Yb) fu venav , 


k 
so that with the help of the orthonormality relation (3.40), 


Wer) = Yo bg Ona = by 5 be bel = WE’) 
k 


results, which, in turn, immediately yields (3.48). 
Applying (3.48) and (3.49) to (3.47), we obtain the final form, whose structure 
is similar to the formula for the mean value of the position vector, namely 


<pP>= J W(r)(-ihV Wr) dv. (3.50) 
V=L3 


This relation directly expresses the mean value of the momentum with the help 
of the wave function w(r) for the corresponding state. The structure of (3.50) 
remains valid even in the limit L — oo. Thus, in the general case of an unlimited 
space, this formula is also valid for calculating the mean values of the mo- 
mentum. In a similar way we can deduce that the mean value for an arbitrary 
power of the momentum can be calculated as 


<p") = J WF (r)(-iAV)’ blr) dV (3.51) 


We can immediately generalize this result for an arbitrary integral rational 
function F(p) with F(p) = >, a, p’ of momentum 


<F(p)> = Jw*()F(-ihV) Wn) dV. (3.52) 


Here, F is an operator. The momentum pis related to the differential operator by 
p=—ihV and F(p)= ep ae (3.53) 


The importance of this relation lies in the fact that if we want to calculate 
the expectation value of the quantity F(p), we need not go through a Fourier 
decomposition of the wave function W(r) and then calculate <F(p)> = 
v, F(hk)aj.a,, as was done in (3.45a) for the momentum p = hk. Instead, the 
entire calculation can be abbreviated by introducing the operator F(p) instead 
of the function F(p), and performing the integral (3.52) directly. In the following 


" The definition and properties of the delta function (x) are discussed in Chap. 5. 


3.5 Three Quantum Mechanical Operators 


we shall apply these relations and calculate three operators which are of 
particular importance in quantum mechanics. 


3.5 Three Quantum Mechanical Operators 


I, The Kinetic Energy Operator. \n the nonrelativistic case we have for the 
kinetic energy T = p?/2m. With V? = A, we obtain the operator T as 


—-—A ; (3.54) 
this is a special case of (3.51) or (3.52). 


2. The Angular Momentum Operator. With the classical angular momentum of 
a particle L =rxp, we obtain the quantum mechanical angular momentum 
operator 


L=rx(-ihV)=—ihrxV . (3.55a) 


The individual components of this operator are 


= 5 é 
i.= -in(x Sy 5) (3.55b) 


(A detailed discussion of the angular momentum operator will be given in 
Chap. 4.) 


3. The Hamiltonian Operator. The total energy of time-independent physical 
systems is described classically by the Hamiltonian function, 


a ae) 


Here T denotes the kinetic energy and V(r) the potential energy. This yields the 
Hamiltonian operator (Hamiltonian), 


= AS V(r) (3.56) 


In quantum mechanics, an operator is assigned to any observable quantity (in 
signs: A > A). Let A(r, p) be a function of r and p. We construct the correspond- 
ing operator by replacing the quantities r and p in the expression for A by the 
assigned operators # = r and p = —ihV. Here the position operator is identical 
with the position vector. (But we must be careful: this is not generally valid; it 
is only true in the Cartesian coordinate representation chosen here! See our 
discussion of quantization in curvilinear coordinates in Chap. 8). 
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3.3 The Expectation Value of Kinetic Energy 


Problem. Calculate the expectation value (mean value) of the kinetic energy 
T = p?/2mwith p = —ihV and of the potential V = —e?/r for the 1 s-electron in 
the ground state of hydrogen with the wave function 


=a 


e , a= 


Solution. The expectation value is defined by 
TY =f ryEOTMWi() 
Vy =fSP yO Vb) 
Using spherical coordinates we get 
a Seek p’ 
=a ris 5 Wis 


! t Z —rla =e 6 2 y —rfa 
eee ia = ror’ or) 


Ta fee i 2 
= 7 { ire (—t) r= [ew 
ma” o a a 
Z 


ae a Zalia 
1 ios) 2 
Py = ef? dren Je na 
0 
— fl 2 4 
=—— fdrre-7"/ eee 
0 h 


The total energy is E= <T + V) = —4(me*/h?); this is the binding energy of 
the electron in the ground state of the hydrogen atom. 


3.6 The Superposition Principle in Quantum Mechanics 


One of the most fundamental principles of quantum mechanics is the principle 
of linear superposition of states or, for short, the superposition principle. It states 
that a quantum mechanical system which can take on the discrete states 
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W,(n€ IN) is also able to occupy the state 


W=>V an - (3.57) 


The probability density is then given by 


ye Xe Geta 


n,m 


These physical circumstances correspond to the mathematical fact that every 
possible wave function y can be expanded in terms of an orthogonal complete 
set of functions w,. We have made use of this fact in (3.42). 

Ifa quantum mechanical system can be in a sequence of states y,, character- 
ized by an arbitrary physical quantity f, the state 


W =f croyrdf (3.58) 


is also realized. Thus the wave equation for y must be a linear differential 
equation (Chap. 6). The superposition principle can only be satisfied as follows: 
if the w, are solutions of the linear fundamental equation, a linear combination 
of type (3.57) will also be a solution because of the linearity of the equation. 


Ore ————————EE>L_ aA 


3.4 Superposition of Plane Waves, Momentum Probability 
The representation of a wave field yr, t) by superposition of de Broglie waves, 


ple) = ansrs eXP Hine En| (1) 


is an example of such a superposition. The normalization factor in (1) results 
from 


J ve yydr = lim N* J exp| —q(p~p)-r a 
-—2a gr —g 


sin£g(px — px)/f) 


ee ee (er) 
sin[g(py — Py)/h] , sinLg(p: — p:)h) 
(py — py)/h (pz — p=)/fh 
= N2(2n)6 (? =) 


= ON (nh) O(p— Pp) 
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Example 3.4 


with 
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The 6 functions play an important role in various mathematical treatments 
encountered in quantum mechanics; they will be discussed in greater detail in 
Chap, 5 (Exercises 5.15)5.2-and) 54); 

Since we are considering the dynamics of a free particle (no discrete mo- 
menta) we do not normalize to unity, but to the delta function, Le. 


{ WyWwpa?r = (p—p’') . (2) 


With this normalization we get 
Nee aye 


The wave function for an arbitrary state w(r, t) can be expanded into de Broglie 
waves (1) according to 


oO 


WrN= J clp ov, 0d p, (3) 
where c(p, t) are the expansion coefficients of the wave field w(r, t) in terms of the 
plane waves (the expansion coefficients correspond to the amplitudes with which 
the particular states, represented by de Broglie waves, are contained in the state 
Wir, t); cf. (3.58). 

Now we are able to show that (3) is simply a factorization of w(r, t) in 
a threefold Fourier integral. The Fourier formula reads: 


I . +ik-r 73 
Wir, = ons \ ope dk (4) 


—0 


~(p, t) is the Fourier transform of the wave function w(t, r). We insert k = p/h 
in (4): 


one. p-r\ dsp 
Wr, t) = (2x3 a e(p, t) exp (+177) aes 


In a similar way we get for the Fourier transform 


Comparing (3) and (4), we find 


ie 
(Pp, t) = \/(2h)* c(p, 1) exp (= Ti 
Now using (2), it can easily be proved that 


+00 


qr eA 
f le(p. DP dp = (nh)? | |o(p, dr d3p 
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+o +0 +00 
=(2nh)"> [ ff d3pd3rd3r' 


—o —0O —9 


x exp | - iF tee | w*(r', Dur, £) 


+00 +00 


=f J @rdrd(r—r ht, Owe, 


GI aeKs 
= i lyr, t)|?d3r = 1 


is valid. The probability of finding a momentum in the interval p,, py + dp,; 
Py, Dy + apy; pz, pz + dp. is given by the expansion coefficients c(p, t). We obtain 
the following expression for the probability: 


dW(p.t)=|c(p,0|?d?p_, 


and for the probability density in momentum space: 


ie 


w(p, t) = |c(p, t) 


3.7 The Heisenberg Uncertainty Principle 


Among other things, the wave character of matter [1.e. that in quantum mechan- 
ics particles are guided by the field w(x, t)] manifests itself in the fact that there is 
a direct connection between position and momentum determination in micro- 
scopic physics, namely, we are not able to measure the exact position and 
momentum of a particle simultaneously. The amount of the uncertainty is given 
by the Heisenberg uncertainty principle. 

Let us first demonstrate the existence of the uncertainty principle. To do this 
connection we consider the one-dimensional wave packet (3.19a—b), illustrated 
in Fig. 3.5, 


sin iAie(@s1 — )i 


vgt — x 


W(x, t) = 2c(ko) ei (aot —kox) (3.19¢) 


at time tf = 0. The extension of the wave group can be characterized by the 
quantity Ax, ie. the distance of the first minimum from the maximum. The 
condition for minima is 

ee 

sin“ Akx 

ly? = 4c? —— = 0 
x 

Thus we get for the first minimum 


Aik AS = ne 
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p(x, 0) 
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Fig. 3.5. The probability 
density of the wave packet 
(3.19c) at time t = 0 
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ee 


Inserting the momentum according to de Broglie, we get as an estimate for the 
Heisenberg uncertainty principle between position and momentum, 


Ape ele (3.59) 


This equation means that the simultaneous determination of position and 
momentum in microscopic physics is not arbitrarily exactly possible; both 
quantities are always related by the above relation. 

Heisenberg’s uncertainty principle is a consequence of the wave character of 
the particles (more exactly: of the guiding field of the particles). Using the 
superposition principle, the probability field is a wave packet superposed of 
waves with a definite momentum (plane waves). The particle guided by this wave 
packet can be found with a high probability within 4x. It is said to be localized 
in Ax. For such a localization Ax, a great number of plane waves with momenta 
near hko, i.e. a momentum packet of width hAk, is required. In classical physics, 
uncertainty relations of a similar form appear in processes involving waves. The 
transmission of a spatially limited electromagnetic signal by a sender occurs in 
the form of a wave packet containing waves of all frequencies (momenta). To get 
a wave with a single frequency, the sender must transmit for as long as possible 
(indefinitely), because the process of switching on and off contributes other 
frequencies. Therefore the wave spreads throughout space and no determination 
of its position is possible. 

After this rather illustrative consideration, we will now derive the Heisenberg 
uncertainty principle in an exact way. Our starting point is an arbitrary particle 
state which is described by the wave function w(x). Furthermore, we assume that 
wis normalized to unity, and we restrict the calculation at first to one dimension 
only. 

In deriving the uncertainty principle, we first have to determine a measure 
for the uncertainty, 1.e. we have to define a measure for the deviation of p,., or x, 
from their respective mean values 


a ) (-in i ) w(x)dx and xX = J W*(x)xp(x)dx 


Ox 


Here, we use the mean-square deviations (dispersions) Ap2 and Ax?, which are 
defined by 


Ate = (= PSs St ’ Ax? =(x — x)? = x? — x? H (3.60) 
For the following calculation we choose a suitable coordinate system: we 
assume the origin to be fixed in the point ¥ and let it move with the centre of the 
distribution xX so that at any time ¥ = 0 is valid. Then we have 

x=O0 and p,—0 


From the relation for the dispersions (mean-square deviations) (3.60) we get 


x?=x? and Ap2=p. . (3.61) 
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The mean values are easily calculated, i.e. 


Ses 


a oars * eo 0? 2 ow 
pe= J W*| Ws) ydx=—-W yt aGds (3.62) 


To establish a connection between the quantities x* and p2, we consider the 
integral 


fe @) 


Ha) = J 


mo 6} 


d(x) 
dx 


24 
axw(x) + dx , «ER . (3.63) 


The integrand is an absolute square. Therefore /(a) is always greater than, or 
equal to, zero. We multiply out and get 


I)=a? f ApPdvta fx (Av eu TE Nas 


dx 


gine alas 


5 CRS ts 2) 
It is helpful to introduce the following abbreviations: 
Ape | pax = Aa 
fe dwy* dw gu 
cS |) a *— \dx=— | x— (b*W)dx 
i x(% Wee | : Y Vyas 
= —xy*p + | p*vdx=1 , (3.65a) 
because w vanishes at the boundaries of the integration; 
~ dy* a dyin ay 
— — dx = ss = * dx 
ore ag era tg rea 
je ae = 


With the abbreviations (3.65), the integral (3.64) can be written 
I{a) = Aa? — Ba + C>0 


As this polynomial of second order in « is positive definite according to (3.63), 
the discriminant must necessarily be negative or vanish. J(«) must be positive for 
all x. Therefore the roots of the quadratic equation I(x) = 0 must be complex. 
Thus the relation 


Be] 4C A <0 


is necessarily fulfilled. Inserting in this inequality the values for A, B and 
C denoted in (3.65), we obtain the uncertainty relation for momentum and 
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Localization of a particle by 
a slit 


The geometry of the diffrac- 
tion at a slit. In the first 
diffraction minimum, a ray 
from the middle of the slit 
differs from a ray from the 
edges by 7/2 


position in the form 


2 
(ax 28 | (3.66) 


ta 
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The experimentally proved wave nature of particles (meaning the existence of 
a guiding field) alone obviously implies that the momentum and coordinates of 
a particle cannot be simultaneously determined; these observables can never be 
measured simultaneously with arbitrary precision. We will see in the following 
that this principle is also valid for other pairs of physical quantities, provided 
that the product of their dimensions has the dimension of action (see, however, 
Chap. 4, “Heisenberg’s Uncertainty Relations for Arbitrary Observables”). 
Using a number of typical examples for simultaneous measurements of 
particle momentum and position, we shall now illustrate the uncertainty principle. 


IR ———E>E>E>E>>>>>—>———_ _—_—_——__= 


3.5 Position Measurement with a Slit 


We observe a de Broglie wave moving in the x direction through a slit 
perpendicular to it with a width d = Ay (see Figure). The corresponding interfer- 
ence pattern is visible on a screen standing parallel to, and behind, the slit. Since 
the momentum in the y direction is given by p, = 0, we would expect that once 
the particle has passed the slit, a simultaneous determination of its momentum 
and position in the y direction is possible. However, the diffraction of the wave 
at the slit causes an additional component of momentum in the y direction. As 
the diffraction is symmetric, we normally have p, = 0. At diffraction angle «, 
corresponding to the first diffraction minimum, the path a light beam travels is 
longer by 4/2 than a non-diffracted beam (see Figure below). Then the greatest 
intensity is to be expected between —a and +4, and we make use of this angle 
as a measure of the momentum uncertainty. The relation for « reads: 


ia abe ae (1) 


The projection of the momentum on the y axis is given by 


: ZT 
psina = Ap, = sin & 


5 


A 


Inserting sina = AAp,/2hn in (1) yields 
a AApy 


2hn 


The uncertainty principle Ap, Ay = 2zh follows from this, i.e. the more precisely 
the particle position d is determined, the less exact the determination of its 
momentum will be. In other words: the smaller the slit, the more the particle will 
be diffracted. 
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3.6 Position Measurement by Enclosing a Particle in a Box 


We will try to determine the position of a particle exactly by enclosing it in a box 
and letting the side / = Ax of the box shrink (! = Ax > 0). The uncertainty of the 
particle’s momentum is given by Ap ~ h/! because standing waves fitting into 
the box have a wavelength of the order of !| (see Figure). 

It then follows for the kinetic energy that 


AT nT 


2m = 2ml? 


Exin = 


Hence, as the box becomes smaller, the kinetic energy and momentum will grow 
according to the uncertainty principle. The result of this “Gedankenexperiment” 
has been experimentally confirmed. Electrons in atoms have energies of 
10-100 eV, their atomic diameter being 10 8-10~? cm, while nucleons have 
energies of the order of 1 MeV, the size of a nucleus being ~ 10~'* cm, which 
confirms the uncertainty principle. Let us check the latter explicitly. To do so, 
we need some numerical values: nuclear diameter ~ 10717 cm, nucleon mass 
m,¢C2 ~ 938 MeV, lic ~ 197 x 107!3 cm MeV. From the uncertainty principle it 
follows that 


jo: 2 
=— aA = =: 
Des AS, nn ein Aix)? 


Inserting the values given above, we get for the order of magnitude of the kinetic 
energy of the nucleon: 


AE = 0.2 MeV 
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3.7 Position Measurement with a Microscope 


We consider a beam of light perpendicular to the x axis and illuminating the 
object to be observed. From the theory of the microscope it is known that the 
x coordinate of a particle can be measured with a precision of Ax = A/sin €, 
where « is the angle illustrated in the Figure. The resolution limit, Ax, 1s 
calculated with the aid of the following argument: for the lattice constant Ax to 
become visible, at least the first diffraction maximum must be observable 
through the lens, i.e. 4x sin é = /; from this it follows that, for given angle ¢ and 
given wavelength A, only quantities Ax ~ A/sin ¢ can be resolved. 

The particle’s image is produced by a photon which is scattered by the 
particle and moves through the lens into the microscope. According to the 
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Compton effect, the momenta change in the scattering process. The particle 
suffers a recoil momentum of the order of ha/c. The momentum is not known 
exactly because of the arbitrary direction of the photons in the cone with the 
angle 2¢. Therefore the momentum transfer to the particle has to lie in the range 


: ho, 
Ap, = psiné = See 
The product of position and momentum uncertainties yields 
ha 
AxAp,™A—=h , 
@ 


which is again the uncertainty relation. 
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3.8 Momentum Measurement with a Diffraction Grating 


We want to make use of the arrangement sketched above of collimator slit and 
diffraction grating to determine the momentum of a matter wave. A particle 
beam of width / is collimated by a slit and impinges on the grating. The grating 
constant of the grating is d. Thus the number of grating lines which are 
important for the diffraction are N = I/d. A grating is able to separate two waves 
of different wavelength if the following condition is valid (resolving power): 


AA | 

A N 
This will be discussed in the next example. 

For an exact determination of the position, we should fix the detector 
directly at that position of the grating from which the particle is scattered. But 
this does not help much because there, all waves of different wave lengths still 
cover the same spatial region. The detector must be fixed at least at a distance 
from the grating where the beams of different wave length separate. Let «(a < 1) 


denote the angle both beams enclose. Then this minimal distance is given by 
As = I/a. Thus, with 


h 
ae and Ap = p resolving power 
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In order to get a diffraction, d and 2 must at least be of the same order of 
magnitude. « is small; thus we have 


ApAs >h 


A Supplementary Remark. An exact momentum measurement of a particle 
could be made by scattering a monochromatic wave from this particle. 
Momentum and energy conservation is valid, so the particle momenta are 
determined, before and after scattering, by the measured frequencies, using 
the momentum-frequency relation. But since a monochromatic (plane) wave 
stretches throughout space, we do not get any information about the position. 
To determine the position of the particle, we should scatter a wave packet which 
is spatially restricted. On the other hand, it contains all frequency (momentum) 
components, thus leading to the uncertainty relation again. 
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3.9 Physical Supplement: The Resolving Power of a Grating 


Let us consider a grating consisting of an infinite number of slits at a distance 
d (see next Figure). Investigating all beams coming from corresponding posi- 
tions of the slits of the grating (for example from the left edge of each slit) and 
moving in a direction defined by the angle £, we observe that the total intensity 
in general vanishes. To such a beam will generally correspond a similar beam 
which comes from a slit at a greater distance and has a phase difference of 
exactly 180° with respect to the former one. The beams will only overlap in 
a constructive way if the difference in wavelength d(sin « — sin B) between two 
neighbouring beams is exactly a multiple of the wavelength. Thus, for an 
infinitely extended lattice, the maxima of intensity only occur under the angle 
B with 


d(sina —sinB)=mA , meéeNo (1) 


(m is called the order of the maximum). Here we have neglected the interference 
structure from the superposition of the beams coming through a single slit; 
however, they contribute to the diffraction pattern, too. In the following, we 
want to calculate the complete structure of intensity behind the grating, which 
now may have a finite number of slits instead of an infinite number. 

We consider a grating of slit width a, line distance d and number of slits (line 
number) N. Let us now calculate the amplitude in the direction f in the case of an 
incident angle « (see Figure). If two beams overlap with a phase difference 1, the 
resulting amplitude will be proportional to the complex number e”, Our aim is 
to integrate over all phase differences, those of the rays coming through the same 
slit and those of rays coming through different slits. The phase difference of two 
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Example 3.9 rays 1s given by 


sin « — sin 
y= heheh) = Or oe (2) 
where é is the spatial separation of those positions at which the rays pass the 


grating. Thus the amplitude wu is given by 


as sin A dé 


a d+a (N-1)d+a 
u~f+ ft--+ f exp |i 


0 d CN Sally di 


(3) 


We substitute 


ee eigen) ans =, {{sine — sin f) 


A - A 


Performing the integrals yields 


, ee ii [Fosins == sin cf df =— —_— = 


2z(sin « — sin fp) 
x (exp f = (sina — sin met) 


—1 lie ; 
= Be orasinif) jer i an (sin « — sin pnt |P 


Jer [iF ins — sn a | ~— 1 


= (ei2")(gi27 _ 1) = —la (ei(2nd +27) gidnd) 
2y 2y 


ndt+a 


nd 


Therefore the wave field behind the grating reads: 
ur I [ = ak co” — e728 ee e2i+y) ine etid 
ig 
N= 


eee ee) = 1] BL ye = 1) oe ezine 
oy n=0 


1 : 
= —(e7 — 1) 
v 


Thus the intensity is 


sin? y sin?Nd 


2 


I~ uu*~ 4 —~ 
po sino 


(6) 
The second factor yields the principal maxima at 6 = mz, ie. at 
d(sina — sin B) = mA (me No) | (7) 


[see (1)]. The first factor (sin’ y)/y* provides the interference pattern of a single 
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slit, which is superposed on the interference pattern (see next Figure, dashed 
lines). 

The condition 61/06 = 0 provides further, less intense, secondary maxima, 
which are separated from the principal maxima and from each other by the dark 
spots at 6 = m'(z/N)(m'eEN) (see Figure). The greater N is, the sharper the 
principal maximum will be and the closer the minima of intensity will lie to 
the principal maximum. Thus, d determines the position of the principal maxima, 
N their sharpness, a the intensity of the principal maxima of first, second, 
third, ... order. 

If, for example, d = 2a, we do not have any principal maxima of even order. 
If we consider gratings with a complicated slit structure, whose permeability is 
not a simple box function but for example sinusoidal (sinusoidal grating), the 
permeability function plays the role of the slit width. 


SS ees Eee Se ee 
he Ee = 


wT Pas 
T 2tt 31 ot é 
The resolving power of a grating is defined by its ability to separate two 
principal maxima. Two principal maxima (for example, belonging to different 
wavelengths) lying at different positions, can still be separated if one maximum 
just coincides with the dark region 6 = n/N of the other one. Thus for both 
maxima the following relation must be valid: 


7 

|4d| = ae (8) 
Aé can be transformed into a wavelength difference 4/: with (4) we have 

A 

| (sina — sin B)| = 5 ; (9) 
and with (7) 

maa — ds 

N 


The resolving power is then given by 
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Put into words: resolution equals order of maximum times line number; the 
greater the number of slits, the better the resolution of the grating. To separate, 
for example, the two neighbouring yellow sodium vapour lines (42 = 6 A at 
A550) A), we need a resolving power of 1000; thus we can separate lines of 
first order with a grating of 1000 lines. On the other hand, if we are satisfied with 
weaker maxima of second order, 500 lines will be enough. 
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3.10 Properties of a Gaussian Wave Packet 


Let a wave packet be described at time t = 0 by 


7 : 
W(x, 0) = Aexp (— 35 + iho] (1) 
(a Gaussian wave packet). 


Problem. a) Express w(x, 0) as a superposition of plane waves. 

b) What is the approximate relation between the width of the wave packet in 
configuration (x) space and its width in k space? 

c) Using the dispersion relation for de Broglie waves, calculate the function 
W(x, t) for any time ¢. 

d) Discuss |y(x, t)|*. 

e) How must the constant A be chosen according to the probability inter- 
pretation, so that w(x, t) describes the motion of a particle? 


Solution. a) We obtain the frequency spectrum of a wave packet (x) b 
forming the Fourier transform «(k) of the wave function: 


ok) FE i W(x, O)exp(—ikx)dx  , 


Ais Pe x ae 
= UE ) exp | > + ikgx — ikx |dx 
This integral is solved by completing the square to give a complete error 
integral, 


{ exp(—22)dé = /a 


Completing the square yields 


eee x ia(k — ko) a*(k — kg)? 
————— peace NEE 
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Now we replace the exponent by —€? and obtain 


A 2 2 
“Tig exp( —€7),/2a exp (- ea 


A 2(k — ko)? 
” Tin aa exp (- EST) fa 


(2a) 


The coefficients a(k) denote the portion of the partial wave with wave number 
k in the Gaussian wave packet. As a superposition of plane waves, the Gaussian 
wave packet has the form 


(Kye dk. (2b) 


1 00 
w (0, k) = Jn I o 


b) In (1), the width of the Gaussian function is approximately 4x ~ a. The 
width of the distribution function of the plane waves in (2a) is given by the 
Gaussian function exp[ —(k — ko)?a?/2]: 


Ake ly a 


Thus the uncertainty principle involving both quantities is 4x 4k ~ 1. 
(c) The general form of a wave function is 


j a(k)exp[i(kx — wt)]dx 


J 


The dispersion relation for de Broglie waves reads 


= hk? 
~ Im 


W(x, t) = 


w(k) 


Inserting a(k) from part a) of this Exercise, we get 


Now we again complete the square, use the error integral and obtain the 
time-dependent wave function, 


A (= — 2ia*kox + eo) 
5) SS ee —_ OE : 
WS) = mad Bal + ilGe/ma?)] 
d) The following holds: 


We = TF elma? a?[1 + (ht/ma?)?] 


The maximum of this Gaussian function is at the position x = hkot/m. The 
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maximum moves with the group velocity v = hko/m. But the wave packet 
“flattens”: at t = 0 the width of |y|? is just a, and at a later time (formally 


speaking: at an earlier time as well) its width is given by a’ = a,/1 + (ht/ma’)?. 
e) Independently of time, the normalization condition for a particle has to be 


90 


1= f |W(x,0?ax =|APa i exp(— =|APa/n . 


—o 


Thus it follows for | A| 


1 
_ (a,/n)"? 


As this condition is valid only for the absolute value of A, the phase of the wave 
remains undetermined. 
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3.11 Normalization of Wave Functions 


For the 1s and 2s electrons, the unnormalized wave functions for the hydrogen 
atom (to be calculated in Chap. 9) are 


Wis ( (r, 3 g)= W15(r) =e ® > 
— hes Q OZ 
Wos(r, 3, ~) = W,(r) = (: a ¢) € ’ 
with @ = r/a and the Bohr radius a = h?/me?. 


Problem. a) Prove their orthogonality and normalize. 
b) Sketch ||? and 4zr?|w/|? for both cases. What is the meaning of ||? and 
Anr?|y|?? 


Solution. a) The normalization condition for Wh, and Ws. which only differ by 
a factor of w,, and W2,, are given by 

[WOW r =f lisl2d3r = Iiasl2dhr = 
Moreover, the orthogonality 

{wtw.4?r=0 


has to be proved. To this end, we calculate the three integrals which appear by 
using spherical coordinates and the relation 


1 
x” te dx = —(v— 1)! (veNo) , 
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obtaining 


(|wi.|?d?r = 4n [ 
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Thus the normalized wave functions are 


2 i ~ 1 
Wis /na3 Wis and Ws ./8na3 Wars 

(b) The probability of finding an electron with the (normalized) wave func- 
tion y in the volume element dV at the position r is simply |W(r)|?dV. The 
normalization condition | |y|?dV = 1 expresses the fact that the probability of 
finding the electron anywhere is just 1. The probability of finding the electron in 
a spherical shell with radius r and thickness dr is 


[ lWPaV =4ar?2|par 


spherical 
shell 


if the wave function is independent of the angles Zand ¢ (as in our case). This 
illustrates the meaning of the second expression. 

For the Is and 2s electrons, the functions look similar to those in the Figure 
below, where we have consciously omitted a scaling of the abscissa. 


2 
Ms, [Ys 
a a 5a 100 ip 
2 
“12. 


if 
a 
Oe ly.I amr 


a2 


Exercise 3.1] 


Probability density (above) 
and probability density in 
a spherical shell (below) of 
the two hydrogen wave 
functions 


60 


3. Wave Aspects of Matter 


LNA LS Smee te 


3.12 Melons in Quantum Land 


In Quantum Land, a strange land, where h = 10* ergs, melons with a very hard 
peel are grown; they have a diameter of approximately 20 cm and contain seeds 
with a mass of around 0.1 g. 


Problem. Why do we have to be careful when cutting open melons grown in 
Quantum Land? Are such melons visible? How big is the recoil of a melon at the 
reflection of a “visual” photon of 628 nm wavelength? 


Solution. From the uncertainty relation ApAx ~ h follows for the momentum 
uncertainty of the melon seeds Ap x 10* ergs/10 cm = 10° gems! and there- 
fore their velocity uncertainty is dv = Ap/m = 100 ms™’. The seeds leave the 
melon with this (mean) velocity when it is cut open. 

A photon of wavelength 2 = 628nm has the momentum p = h(2z/4) = 
10° gcms7! and the energy E = pc = 3x 10! erg. The mass of the melon is 
m~ (42/3)R° x 1 gcm~*~ 4kg, its rest energy is mc? ~ 3.6 x 10** erg; hence 
we can calculate nonrelativistically. Let the collision be elastic. The momentum 
of the melon after the collision is approximately py = 2p = 2x 10'? gcms_!. 
This corresponds to a velocity vy =5kms~! which is less than the escape 
velocity from the Earth. By the time this photon is seen, the melon is already 
situated elsewhere (ApyAXy = h!). The absorption of such a photon would 
probably be rather unpleasant for a human being. 
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DE BROGLIE, Prince Louis Victor, French physicist, 1892-1987, professor of theor- 
etical physics at the Institut Henri Poincaré. He founded with his Ph.D. thesis “Re- 
cherches sur la Théorie des Quants” (1924) the theory of matter waves (de Broglie waves) 
and was awarded the Nobel Prize in Physics for it in 1929. Later he worked mainly on the 
development of the quantum theory of elementary particles (neutrino theory of light, 
wave theory of elementary particles) and proposed a new method for the treatment of 
wave equations with higher spin, the so-calicd fusion method. 


DAVISSON, Clinton Joseph, American physicist, *Bloomington (IL), 22.10.1881, 
+Charlottesville (VA), 1.2.1958. From 1917 to 1946 D. was a scientist at Bell Telephone 
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Charlottesville. In 1927 D. and L.H. Germer measured electron diffraction by crystals, 
a decisive proof of the wave nature of matter. In 1937 he was awarded the Nobel Prize in 
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Berlin and, from 1946, Director of the Institut ftir Physikalische Chemie und Elek- 
trochemie in Berlin-Dahlem. v.L. was the first director of the Institut fiir Theoretische 
Physik in Frankfurt (from 1914 until 1919), his successor being M. Born. His proposal to 
irradiate crystals with X-rays was performed by Walther Friedrich and Paul Knipping in 
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thermodynamics. Further treatises covered superconductivity, glow-electron emission 
and the mechanism of amplifier valves. After 1933 v.L. tried, often successfully, to oppose 
the influence of national socialism on science in Germany. 


BORN, Max, German physicist, *Breslau, Germany (now Wroclaw, Poland) 
12.12.1882, + Gottingen 5.1.1970. B. was a professor in Berlin (1915), Frankfurt (1919) and 
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Natural Philosophy in Edinburgh in 1936. From 1954 on, B. lived in retirement in Bad 
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succeeded in 1925, together with his students W. Heisenberg and P. Jordan, in the 
creation of matrix mechanics. In Gottingen, B. founded an important school of theoret- 
ical physics. In 1926 he interpreted Schrédinger’s wave functions as probability ampli- 
tudes, thus introducing the statistical point of view into modern physics. For this he was 
belatedly awarded the Nobel Prize in 1954. 
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14:2.1943. H., son of a lawyer, studied in KGénigsberg and Heidelberg and became 
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proved himself as a world-wide authority in his famous talk given in Paris in 1900, where 
he proposed 23 mathematical problems which interest mathematicians even today. H. 
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e.g. on the theory of invariants, group theory and the theory of algebraic manifolds. His 
investigations on number theory culminated in 1897 in his report, “Die Theorie der 
algebraischen Zahlk6rper” and in his proof of Warring’s Problem. In the field of 
geometry he introduced strictly axiomatic concepts in “Die Grundlagen der Geometrie” 
(1899). His works on the theory of integral equations and on the calculus of variations 
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development of set theory and the problems arising in the foundations of mathematics, 
H. created his proof theory and thereby became one of the leaders of the axiomatic 
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HEISENBERG, Werner Karl, German physicist, * Wirzburg 5.12.1901, +Miinchen 
1.2.1976. From 1927-41 he was Professor of Theoretical Physics in Leipzig and Berlin; in 
1941, professor at, and director of, the Max-Planck-Institute fiir Physik in Berlin, 
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Thus the more intuitive ideas of the older Bohr-Sommerfeld quantum theory have to be 
rejected. At the same time H. provided the foundation for the new Géttinger matrix 
mechanics in his rules for multiplication of quadratic schemata, which he developed 
together with M. Born and P. Jordan in Sept. of 1925. In close collaboration with 
N. Bohr he was able to show the deeper physical- or philosophical-background of the 
new formalism. The Heisenberg uncertainty principle of 1927 became the basis of the 
Copenhagen interpretation of quantum theory. In 1932 H. was awarded the Nobel Prize 
in Physics “for the Creation of Quantum Mechanics”. After the discovery of the neutron 
by J. Chadwick in 1932, H. realized that this new particle together with the proton must 
be considered as constituents of atomic nuclei. On this basis he developed a theory of the 
structure of atomic nuclei and introduced, in particular, the concept of isospin. From 
1953 on, H. worked on a unifying theory of matter (often called a world formula). The aim 
of this theory is to describe all existing particles and their conversion processes using the 
conservation laws, which express the symmetry properties of the laws of nature. A nonlin- 
ear spinor equation is supposed to describe all elementary particles. 


JACOBI, Carl Gustav Jakob, *10.12.1804 in Potsdam as the son of a banker, 
+18.2.1851 in Berlin. J. became an instructor in Berlin after his studies in 1824 and 
1827/42 was professor in K6nigsberg (Prussia). After an extended travel to Italy, which 
was to cure his impaired health. J. lived in Berlin as a university man. He became famous 
because of his work “Fundamenta nova theoriae functiorum ellipticarum” (1829). In 
1832, J. found out that hyperelliptic functions can be inverted by functions of several 
variables. J. also made fundamental contributions to algebra, to elimination theory, and 
to the theory of partial differential equations, e.g. in his “Vorlesungen tiber Dynamik”, 
which were published in 1866. 


4. Mathematical Foundations 
of Quantum Mechanics I 


4.1 Properties of Operators 


We have already used average values of position and momentum of a particle 
and seen that we can get the average value of an observable F [represented by an 
operator function F(x, p)] in a state y by 


(Fy) =F= [yp*Fyav , (4.1) 


where F is the operator which is somehow related to F. In a first approach we 
are now going to deal with operators from a more general point of view. After 
this we shall determine a class of operators which is very important in quantum 
mechanics. 

Let U and W be two sets of functions. We define a continuous mapping L: 
Us W with L(u) = w(ueU; we W), and call L an operator. The operator 
L relates a function ue U to a new function we W. Symbolically we write this 
relation as a product of the operator L with the function u: 


L(u) = Lu = w 
An operator with the property 
DGateoes) = oh, a, Lu , (4.2) 


where uj, u, are arbitrary functions and «,, «, are arbitrary constants, is called 
a linear operator. 

We can see that the position operator X = x and the momentum operator 
px = ihé/Ox are linear operators. A typical nonlinear operator is, for instance, 


the square-root operator, as \/a%,u, + &,U, # a / uy + 45,/uz, obviously 


holds. Furthermore, a linear operator is self-adjoint or Hermitian if 

futLindV=f(Lh)* hav , (4.3) 
where jj, i are arbitrary square-integrable functions, whose derivatives vanish 
at the boundaries of the region of integration. 

In quantum mechanics we require that all operators be self-adjoint and 
linear; in this case, the superposition principle holds. Of course, linear operators 
do not violate the superposition principle. In order to be able to describe 
meaningful and measurable quantities with our operators, we must demand that 
their average values be real. This property is guaranteed by Hermitian (self- 
adjoint) operators. We can show this in the following expression: 
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L=[b*ipav =| (ip*pav= i yncinav | =o (4.4) 


and therefore the mean value is real. 


4.2 Combining Two Operators 


We define the sum 4 + B = C of two operators, 


Cy =(A + B)W = Av + By (4.5) 
and their product AB = C 
Cw =(AB)W = A(BY) (4.6) 


Equation (4.6) means that B acts on first, and then A acts on the new function 
(By). If A and B are Hermitian, we notice immediately that A + Bis Hermitian, 
too. The product operator C requires more care. 

It is important to realize that the product of two operators in general does 
not commute, ic. AB — BA # 0. Hence the order of the operators is important: 
in general, A(By) 4 B(AW). For instance, p,.xv # xp, since —ihd/@x(x) 
# x(—ih(dw/éex). 

Two operators commute if, and only if, 


AB-—BA=0 . (4.7) 
We call this expression a commutator and write it as 

AB—BA=[A,B]_ . (4.8) 
In analogy, we define an anticommutator as 

AB+ BA=[A,B], . (4.9) 


Now we are going to obtain an answer to the question: under which 
conditions is the product AB of two Hermitian operators Hermitian, too? We 
rewrite the product AB as 


AB = 4(A, B], +3[4, B]_ , (4.10) 


and we will now show that the 4[A, B],-part is always Hermitian, whereas the 
$[A, B]_-part never is. Let us start with the following relation: 


3S WILA, Bla yodv = 3 f We(4B + BA), dV 
= 2) (Ai)* Bin dV + 5 f (BY)* Aya dV 
= 3 (BAW)*W2dV + 3 [ (ABY,)* 2 dV 
+{ (BA ae AB)* Wt W.dV 
= S108 Anal. (4.11) 


I 
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As AB + BA= BA + AB, the 4[A, B],-part is always Hermitian and as 
AB — BA = —(BA — AB), the 4[A, B].-part is only Hermitian if it vanishes. 
Hence, the product AB of commuting Hermitian operators is Hermitian. As any 
operator commutes with itself, A” is Hermitian if A is, and so is BRE if A and 
B are Hermitian and commuting. 


4.3 Bra and Ket Notation 


The integral ie Wwiw.dV can be considered a scalar product of the square- 
integrable functions y, and wy. Usually the following shorthand notation is 
used: 


i |W2> = | WitodV . (4.12) 


This is interpreted to be a product of two elements <y,| and |W >. The element 
<w,| is called a “bra” and |i) is called a “ket” ', together forming the “bra-ket” 
(bracket). Both are vectors (state vectors) in a linear vector space. By using this 
notation, many relations in quantum mechanics can be expressed more succinctly 
than by using integral representation. 

The state vectors are vectors of a complex linear vector space with an 
orthonormal base. Every expression in integral representation is related to an 
expression in Dirac notation. For instance, the orthonormality relation reads 


(We = Wl Vn> = mn - (4.13) 


Obviously, | >* = | holds. We can write the expectation value of an operator 
lines 


CWiLlW> =Jy*thydv , (4.14) 
and the Hermiticity of Z is denoted by 
WiLly> =<Lhly> . Gol2) 


4.4 Eigenvalues and Eigenfunctions 


We can obtain more information about a Hermitian operator E and that which 
is physically related to it if, besides the known mean value L, we can get an 


expression for the mean-square deviation (AL)?. First, it is necessary to find 


1 This denotation originates from the famous physicist, P.A.M. Dirac, whose contribu- 
tion to relativistic quantum mechanics we will learn about in a forthcoming valume of 
this series. 
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a quantum-mechanical operator describing (4L)*. This is straightforward; 
in fact, we obtain the deviation from the mean value by 


Alpes = 1h (4.16) 
and hence the square of the deviation as 

Abin) (4.17) 
The mean-square deviation can be expressed by 

(DP = eaiieia (4.18) 


and it must be non-negative. Indeed, from 
(AL)? = Alay (4.19) 


and the Hermiticity of AL, it follows that 


(AL)? = { (ALW)*(ALWdV = { |ALwWi?dV>0 . (4.20) 


= 180! —o 


As the integrand is a nonnegative function, the integral is positive definite and 


hence (AL)? is positive definite, too. 
Now we search for those states , for which the quantity L has a constant 
value, i.e. for which the deviation AL of L vanishes. For states of this kind, 


AL? =0 holds, and we obtain 
fl4fuy,PdvV=0 . (4.21) 


The integrand is a real function which cannot be negative (as it is the absolute 
value of a complex function). Hence 


Aly, =0 . (4.22) 
We can write this relation using the definition of AL as 

(L-L)W,=0 , (4.23) 
and since we may put L = L in the state W,, 

Lh = Lh (4.24) 
holds. 


An equation of this kind is called an eigenvalue equation. We call w, an 
eigenfunction and L an eigenvalue of the operator L. In general, an operator 
L has several eigenfunctions W,, with eigenvalues L,. The eigenvalues L, can 
form a discrete spectrum L,, Lz, L3,... or a continuous spectrum. In the latter 
case, the eigenvalues L will take on any value within an interval L, < L < L,41. 
We soon will encounter operators with discrete, continuous and mixed spectra 
(see Fig. 4.1). 

Now we are going to examine some general properties of eigenfunctions. 
For this, let us investigate the eigenfunctions of Hermitian operators only 
and restrict ourselves to the case of the discrete spectrum. We can show that 


4.4 Eigenvalues and Eigenfunctions 


L L 


J 


continuous 
bands 


~~ 


a 


L 
a: continuous 


(b) (c) (d) 


eigenfunctions belonging to two different eigenvalues are orthogonal. Let y,, and 
Wn be eigenfunctions the eigenvalues L, and L,,, respectively, i.e. 


end ee (4.25) 


We take the complex conjugate of the first equation and find, as the eigenvalues 
are real, 


Ei ee, (4.26) 


Now multiplying the second equation in (4.25) by wx and the complex conjugate 
of the first equation by y, yields 


Wil, = (Ls ; Ww, Li = Dee ain : (4.27) 
The difference between these two equations 1s 


If we integrate over the entire volume, we obtain 


{ UEitedV — f Uol*WRdV=(La— Ln) § VabhaV (4.29) 


es 


As Lis a Hermitian operator, both integrals on the I-h.s. are equal, and therefore 
V=(L,— h.) { ed (4.30) 
We required that ne ioe ence 
C= i WiwmaV , (4.31) 
which is the desired result and proves that y, and w,, are orthogonal. 


As the eigenfunctions of a discrete spectrum are square-integrable, they can 
be normalized to unity: 


fvwrdV=l . (4.32) 


Then we can combine relations (4.31) and (4.32) as 
{ UWhdV = Sam - (4.33) 


Hence the system of eigenfunctions is an orthonormal function system. 
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Fig. 4.1. (a) General spec- 
trum; (b) totally discrete 
spectrum; (c) totally con- 
tinuous spectrum; (d) spec- 
trum with continuous bands, 
as occur, for example, in the 
energy spectrum of a crystal 
lattice 
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In general, there are several eigenfunctions for one eigenvalue L,; we 
call them degenerate states. To be more precise, if a different eigenfunctions 
Wnis++>Wnq belong to the eigenvalue L,, we speak of an a-fold degeneracy. 
Physically, this degeneracy describes the possibility that a certain value of the 
observable L can be realized in different states. 

We have proved that eigenfunctions of a discrete spectrum with different 
eigenvalues are mutually orthogonal. If we have degeneracy, the functions ,, 
are related to the same eigenvalue L,: LW = LyWyx, with k = 1 ... a; hence, they 
are in general not orthogonal. But there is always the possibility of finding 
orthogonal functions in this case, too, as we shall now show. 

Let us assume that the eigenfunctions y,, (k = 1... a), related to the eigen- 
value L,,, are linearly independent, Le. if Yf_, 4.Wnx = 0, then a, = 0 holds for all 
k. If we could not infer a, = 0 for all k, we would be able to express at least one 
function by a linear combination of the others, and the number of eigenfunctions 
would be smaller than a. If the set of w,, is orthogonal, we can use it to describe 
a certain state. If it is not orthogonal, we transform this set into a new set, Le. 


Ona = QakYnk » = ee (4.34) 
k=1 


This transformation is linear; thus the functions ,, are also eigenfunctions of 
the operator L of the eigenvalue L,. We now require orthogonality of the new 
functions @~,,: 


J png a= Oap 


ic) 


The conditions that have to be fulfilled by the coefficients a,, in order to describe 
a transformation to an orthogonal function system are 


i 


a 
dae Sue = Op with (4.35) 
k=1k’=1 


Skk! = i Wie Wun IV 


iv 6) 


The coefficients a,, are determined by analogy with geometry. We consider the 
functions y,, aS vectors in an a-dimensional function space and s,,° as scalar 
products of these vectors. Then we can regard transformation (4.34) as a basis 
transformation from an oblique-angled to an orthogonal coordinate system. 

Hence applying this procedure to the case of a degenerate spectrum, we can 
obtain an orthonormal set of eigenfunctions. A practical method is E. Schmidt’s 
orthogonalization method, familiar from geometry (vector calculus). In the first 
step we take one vector (state), for instance ,,, and define the normalized wave 
function Ont = Wril Wnt [War>. 

In the next step we construct a vector @,7 = %@,1 + Bw,2 and require 


Ont |Pn2> = XC Pnt |Pni> ae BK \Wn2> = 0. It follows that a/ Bp = a |Wn2>- 
Apart from this condition, normalization is required, i.e. €®n2|On2> = 1. From 
these two conditions follow « and f. The third step is the construction of 
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Ons = “Ont + B'@n2 + YWn3. Again, orthogonality of this vector (state) to @,; 
and @,2 and normalization are required. Hence there are three conditions to 


determine a, f, y etc. 


The next steps are now straightforward. We note that this transformation is 
defined only up to an orthogonal transformation. If the functions w,, are already 


orthogonal, then s,., = 64, and 


a 
y ATK Ape = Sap 
es i 


holds. This is the condition for an orthogonal transformation. 


(4.36) 


In the case of continuous spectra we cannot numerate eigenvalues and 
eigenfunctions. Instead we parametrize the eigenfunctions and take the eigen- 


values as parameters. Then the equation 


Lyp() = Lnthy(X) 


becomes 


“A 


Pye = Luce Ly. 


(4.37) 


(4.38) 


if x denotes all coordinates appearing in the wave function y (for instance 
x = x, y,z). From the wave functions which are not orthogonal we can define 


Weyl’s eigendifferentials: 


lies Ae, 


AW(x, L)= J Wx, Lal 


(4.39) 


They divide up the continuous spectrum of the eigenvalues L into discrete 
regions of size AL (see Fig. 4.2). The eigendifferentials are orthogonal and can be 


normalized. (See the mathematical addendum in the next chapter.) 


QE === eee eS 


4.1 Hermiticity of the Momentum Operator 


We show that the momentum operator p, = —ihd/0x is Hermitian: 


a= J Wi pxt,dV = J i (—ins ) year 


“0 @ 
—in | vt (z vs] dV 


a 7) 
—ih[Wovf]*. + 1h 4 Wa ax widV 


Xs 


As w, and yw are square-integrable functions, 


(1) 


bat 
Lat 
bar 


Fig. 4.2. Chopping of the 
continuous spectrum by in- 
tegration of the function 
w(x, L) over intervals AL 
leads to H. Weyl’s eigendif- 
ferentials 
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4. Mathematical Foundations of Quantum Mechanics I 
holds, and we obtain 
; (ee) 0 oO y 
Px =ih i) Vet NEN i) (PxWi)*Y2dV (2) 


This proves that p, obeys the Hermiticity relation (4.4). 


TS 


4.2 The Commutator of Position- and Momentum Operators 


We compute the commutator [p,, X]. Since 


Dex — -ih = (y= ~in(y e+ x) = ~in (vex) : 


Ox Ox x 


EXERC SE re 


4.3 Computation Rules for Commutators 


Problem. Let L, 1,, L,, L3,M: H —> H be linear operators in a complex linear 
space and a be a scalar. Let E denote the identity operator. Show (with the help 
of the definition of a commutator) the following identities: 


[L, MJ. =—[M, £]- (1) 
(L,L]. =0 (2) 
[L,aM]_ =a([L, M]- (3) 
[L, aE]. =0 (4) 
Pe ey Ve = Ve le ola, IT (5) 
[iM = eee al (6) 
[M, Ly Ey). = (Me ee ieee 7) 
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Solution. The first five relations are trivial (they follow directly from the defini- 
tions [L, M] = LM — ML). The proof of the other relations is also simple, but 
it is important to pay attention to the order of the factors: 


eel aM NLT. 


Sel Wee LM, ML. 
= [oy WAVER a eval (6) 
ae ie = =e Mii, — 1, [h5, Nt 
Sh, WU Tin) SIV eve ia (7) 
[Ly, [L2, L3]] aa care ibe Le] aie eae Lg lis. 
See. eet eather eS Tbe 
ce —[(Es, fils ell as Lees, Lexy £1] : (8) 


The last equation is also called the Jacobi identity. 


EXAMPLE 


4.4 Momentum Eigenfunctions 


The equation for the eigenvalues of the momentum operator is 


d 
Ben) = Pa.) or ih VM py, (x) or 
AW, (X) Px 
dx pias 


We infer for every p, within — © <p, < @: 


Wp (x) = Cexp ( = x) BiG ar. 


Cis a (at first arbitrary) constant; we will calculate its value in Example 5.1. The 
spectrum of momentum is continuous: there is an eigenfunction for every 
momentum p,; we recognize this eigenfunction as a part of the well-known 
de Broglie Wave [see Eqs. (3.3 and 37) ]. 


Exercise 4.3 
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4.5 Measurability of Different Observables at Equal Times 


We know from Heisenberg’s uncertainty principle that it is impossible to 
measure the coordinates and momentum of a particle simultaneously and 
exactly [see (3.59 ff.)]. The value of an observable is unambigously defined if the 
wave function is an eigenfunction of its related operator, 1c. 


lw=Ly . (4.40) 


Then, in state wy, the observable L is well-defined, i.e. it has precisely the value 
L and its mean square deviation (AL)? is zero. In general, W is not an eigenfunc- 
tion of another operator M. Hence, we cannot infer information about the 
observable M from the wave function wy. Only if is an eigenfunction of M, too, 
can we measure both M and L sharply, i.e. 


Ly=Ly and My=My . (4.41) 


As both equations hold, we obtain [L, M]_w = 0, because MLy = LMy = 
LMw and LMw = ML = MLy. By subtraction we get (ML — LM) = 0. 

We have thus found that two observables are measurable at equal times if 
their commutator, acting on a common eigenfunction, vanishes. In the other 
direction we obtain the following result: if [£,M]_ = 0, then for every y, 
LMw = ML. If w is an eigenfunction of L, we obtain L(My) = L(Mw) and, 
w' = My is an eigenfunction of £, too. If L is not degenerate, we can infer 
My = My; ie. W = My is a multiple of w (here, My). 

In the case of degeneracy, wy’ = Mw can be a linear combination of f degen- 
erate eigenfunctions yy, (k = 1,2,..., f) of the eigenvalue L. Then we have 


ff 
DS Mae Wy: 5 C= il. een & - (4.42) 
k’=1 
Thus we cannot repeat the conclusion used above. But as the choice of the 
original wave function is arbitrary (we recall that LM = MLw must hold for 
all possible yw), we can use a linear combination, 


ip 
w= ) Ae Wr, (4.43) 


k’'=1 


as the initial wave function, instead of y,. Of course 


Lo = Lo (4.44) 
holds, too. Now we choose the coefficients a, in order to obtain 
Mop=Mo . (4.45) 
We can do this because inserting @ into this equation gives 
f Ui 
» ay My, = M » Ay We. (4.46) 


After multiplication by the bra vector <k| (corresponding to the operation 
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{wi ... dx) and using the orthogonality condition (yj|Wje> = dx we obtain 


Lf 
DIM a Maa. (4.47) 
k'=1 
Let us abbreviate the matrix elements <k|M|k’> by Myx: = (k|M|k’>. Since we 
obtained a linear homogeneous system of equations for the a,., its coefficient 
determinant must vanish, i.e. 


Mil ) M2 ‘h TINT, 
My, > My.—M ) Moy 
=0. (4.48) 
Mp $3 Mp2 3 eaaiVip po ME 


The solution of this equation gives the eigenvalues M. We thus see that in the 
case of degeneracy of the eigenfunction y, of L, we can also construct the wave 
functions @ = Y, 44%, which are simultaneously eigenfunctions of L and M. 


4.6 Position and Momentum Operators 


If we start with a wave function y = w(r), the position operator is the space 
vector itself: 


p= (4.49) 
Its components are 
ee = fo, 2 (4.49a) 
The operator of momentum is expressed as 
p=-ihv , (4.50) 
and its components are 
0 7] one 


Peete ors ’ a dl ae ’ p= —1h = q (4.50a) 


The commutators are 
Piso = (a tale = ae 


les; De ile = i p\e os ley ele = Ly, ple = Iz ple = liz Ble = 0 
(4.51) 
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Hence there is an uncertainty relation between the coordinates and their 
canonical conjugate momenta (x and p,, y and p, ...). They cannot be exactly 
measured simultaneously. (See the following section, where this will be discussed 
in detail.) On the other hand, e.g. the & operator and the p, operator do 
commute. Hence these two observables can be measured simultaneously as 
accurately as desired. Their common eigenstates are 


a OlX — ee) EXD (, ry) Peis (4.52) 


For the definition of the 6(x)-function we refer to Chap. 5. 


4.7 Heisenberg’s Uncertainty Relations 
for Arbitrary Observables 


We are now ina position to consider the uncertainty relations in a more general 
way. Let two physical quantities be described by Hermitian operators A and 


B[e.g. A = Xis the position operator and B = p, = —ih(0/éx) is the momentum 
operator]. The commutator of the two operators is written as 
[A,B]. =AB-BA=iC , (4.53) 


where C is called the remainder of commutation (commutation rest). C can be zero: 
then A and B commute. In general C is a Hermitian operator, because we know 
from above that 


{ WiTA, BI 2dx = [ Wt(AB — BA), dx 
= j [(B*A* — A*B*) WF] o dx 


= — {[(AB— BA)W,]*Wodx (4.54) 
Hence 
fvTicCy, dx = — fiCW)* 2 dx or [yicCy, dx 
=|(CWi)*Wrdx 22) 


The physical quantities corresponding to the operators A and B in an arbitrary 
state have the mean values 


A= |w*Awdx and B=fy*Bydx . (4.56) 


As before in (3.60) and (4.16), we introduce operators for the deviation from the 
mean value, 


4A=A-—A and AB=B-B , (4.57) 


and recognize that 4A and AB obey the same commutation relations as A and B, 
namely: 


[AA, AB]_ =i€ 


In analogy to our considerations about the uncertainty relation of p, and & [see 


4.8 Angular-Momentum Operators 


(3.63) ], we examine the integral 
[oN WGAA—IAB fade = 0 | (4.58) 
which depends ona real parameter «. As AA and AB are Hermitian, we can write 
I(a) = | (@AA — iAB)**(wAA — iAB)y dx 
= [p*(adA + iAB)(@AA — iAB)W dx 
= | W*[a2(4A)? + ia(ABAA — AAAB) + (AB)? ] yp dx 
= | *[a2(4A)? + 0 + (ABP pdx =O . (4.59) 


Now we denote by <|(4A)?|> = (AA)’, <|E|> = C, ¢|4B?|> = (4B)? the mean 
values of the squares of deviation, or of the commutation rest C. We can 
therefore write the last equation as 


— CP ap __© 
(4A)* | a+ + (AB)* — - Ue: (4.60) 
D(A) 4(AA)? 
As this holds for every real a, we have 
(C 2 (x\2 
AB)* — Ooi (AA)*(AB)? > er (4.61) 
4(AA)? : 


This is Heisenberg’s uncertainty principle in its most general form. Obviously it 
holds for all physical quantities with noncommuting operators. For commuting 
operators (C = 0) we have no uncertainty relation for the corresponding phys- 
ical quantities. They can be exactly measured simultaneously. From (4.51) 
above, we know that [f,, *] = —ih. Hence the uncertainty relation for these 


quantities is (Ap,)?(Ax)? = h?/4, which coincides with the result we got earlier 
[see (3.66) ]. 

In Chap. 6 we will prove that the energy operator is E = +ih(6/dt), and that 
the commutation relation 


[ele = (4.62) 


holds. Hence there also exists an uncertainty relation between energy and 
linleele. 


(ee 5 2 


(AE)? (At)? 25 (4.63) 


We will obtain similar results for the angular-momentum operators in the next 
section. 


4.8 Angular-Momentum Operators 


We want to derive an operator for angular momentum. For this we insert the 
operators * and p into the classical definition of the angular momentum 
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L =rxp, and obtain the operator equation 
L=?xp=-—ih(rxV) 


Expressing the cross-product in Cartesian coordinates yields with (4.50): 


- a ne : 0 a] 
2 — Dn eee | 
Oz 
e 0 0 
L, = Xp, — 9p, = —ih|x—-— . 4.64 
2 = Xpy — YD, in(x ra) (4.64) 


As the factors of the various products are all commuting Hermitian operators, 
L is also Hermitian [see (4.10 and 11)]. By straightforward calculation, we 
obtain the commutation relations of the angular-momentum components, 

ih = 140 =i en ee 

Lijbees Lolo ail, (4.65) 
which are frequently written in shorthand notation as 

Ex£L=ihE~ oralso [£,,L,)- =iheyl, . (4.66) 
Here, &;, is the totally antisymmetric tensor in three dimensions, 1. 


+1 if i,j,k is an even permutation of 1, 2,3 
&y =< —1 if i,j,k is an odd permutation of 1, 2, 3 
0 if two or more indices are equal 


By way of example, we test the first relation (4.65) and get 
les a Lys — (yp: = ZPy)(ZPx a Xe) a (2p. = xP2)(yBz =< ZPy) 
= Y(P2Z) Px oP YZPzPx on YXPz Dz poy Ze Dye oe Ze De 


a ZYPxPz =e BD By ta XVPzPz me x(pzZ) Py a KZpe py 
= —ihyp, + ihxp, = ih(xp, — yp.) =ihL, (4.67) 
The terms underscored in similar fashion cancel out. Thus the components of 
angular momentum are not measurable at the same time, because the relations 


(4.65) are of the structure of (4.53) with a commutation rest. The square of the 
angular-momentum operator 1s 


ihe = ie eg ee 
It commutes with all components of the angular-momentum operator, Le. 
[L7, 0, = (oe ee ee (4.68) 
By way of example, we compute the first commutator and get 
[£7 L.)- = (2 6 ly eee eae ne 
=(30,.= 1,1))- (G0 eee (4.69) 
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Using (4.65), the first term becomes 


ein ie wh 2 — (hl, + L,)— 21s 
= = Pllylie de (ii ee EL Vin = Ts 
= Sigil Ee (4.70) 


and similarly, we get for the second term 


=thL.L, +(L,L,)L,— L,L2 
Sei eee (in ermal) i. = tee 
SE I 8 LL) (4.71) 


The sum of both terms, (4.70) and (4.71), is zero. Hence, [L2, L,] = 0. Similarly 
one proves the second and third relation (4.68). 

It is convenient to write the angular momentum in spherical coordinates. By 
the transformation 


<= sie cos@ 4 y=ersn’sing , Z=reos) , (4.72) 


we obtain for the Cartesian coordinates of the angular-momentum operator: 
Cail singe -tont é 
~=ih|sing—-+cot ¥cose—)]} , 
99 ? 80 
ian a. genet 
.=ih| —cosg—-+4cot dsing—] , 
; ° 39 ° a0 


A 0 
ee (4.73) 
0g 
The equations 
fax + yt tz ’ cos = 
hold, and hence, 
Si ee eee eli, as 
me apie e ) on | yer) By G0) Oy Co 
ord, 89 6 dg @ 
ox dr 0x09 éx dp 


ee: : @ cos%sing 0d cos a 
= —ihr sin §<cos p sin 3 sin p= + + 


—rsin & sin o| 


r og rsin dep 


: : 0 cos ¥%cos@ d sing 0 
—sin p sin # cos pa + 


r a9 rsinddp 


ee (4.74) 
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and 
je ee se ee IL? 
. 0? 0 a. a? 
= =i {sin? 9 + cos? ~) 93 + cot Iagt cot? ae = tare 
liere 0 1) 6? 
ges ee ees g = = fie A : 4.75 
‘ jot yaa(s™ =) tae} ae oe 
where we denote by Ag, that part of the Laplacian acting on the variables 3 and 
only. In this context we also write down the eigenfunctions of jor 
{2 Vale. ~) = os Yim, ~) ‘ (4.76a) 
These are the spherical harmonics, as will be proved in two different ways in 
Examples 4.8 and 4.9 below. We are familiar with spherical harmonics from 
electrodynamics; they are related to the Legendre polynomials by 
(i — m)!(21 4+ 1) ; 
Ya) Ses Ad 
im('9, @) | eee Gok os (4.77) 
The Legendre polynomials are 
- (—)" . qitm 
P; (x) = ary tt iat ) be dx per as _ 1)! ’ (4.78) 
aN ee 2 Ih 
In the eigenvalue equation (4.76a), the quantity L? can be expressed in terms 
of I by 
= eo eee 0 lee ee (4.79) 
so that (4.76a) becomes 
L? Yin('9, p) = FUL + 1) Yim( IG) - (4.76b) 
mal _P3G4+1)=27 +y7 
ae By our choice of coordinate system, the z component of the angular momentum 
was given preference, as the Y,,,, are also eigenfunctions of L,: 
2h a 
A Lin ny, . = —l 1 lene (4.80) 
; This can immediately be confirmed by 47 73, 77, and 78). Obviously the spectrum 
a of L? and L, is always discrete. Because L? and L, commute (4.68), they can be 
ame measured simultaneously. The simultaneous eigenfunctions are the Yi,(9, @). 
_ 2h Every eigenvalue h71(1 + 1) of L? is (21 + 1)-fold degenerate because to every 
ae I there are 2/ + 1 eigenfunctions ¥,, (J >m> —1). 


Fig. 4.3. The quantum num- 
bers mh characterize the 
quantization of the z-com- 
ponent of angular mo- 
mentum. One sometimes 
speaks in this context of 
“quantization of direction” 


Indeed we can infer from (4.77) and (4.80) the z projection of an angular 


momentum L with absolute value h./l(i + 1). It takes 2! + 1 different values mh. 
This is illustrated in Fig. 4.3. The angle between the angular momentum and the 
direction of quantization (e.g. defined by a weak magnetic field) can have only 
certain values: 

aa) 


a 4.81 
cos Fite (4.81) 
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This is sometimes called quantization of direction and means nothing more than 
the quantization of the z component of angular momentum, i.e. L,. The thus 
obtained results can be interpreted in a pictorial way (see Fig. 4.4); the angular- 
momentum vector L precesses on a cone around the direction of quantization 
(z axis). As a result, the x and y components of angular momentum are not 
constant in time. This illustrates the uncertainty relations between L, and 
L, and between L, and L, [see (4.65) ]. 


4.9 Kinetic Energy 


We obtain the operator of kinetic energy in Cartesian coordinates by analogy 
fom — 39/20) as 


Pp Peo Ae 92 92 " re 
Ae = 5a =—-— dA. (4.82) 


—tast 
@x2 dy?" Gz? 


It reads in polar coordinates: 


a HP re ra) 1 h? 1a o h? 
== 2 Uli ease cea 8) 
2m E Cr ( 5) = p20 ‘| 2m r? Or ( ) 2mr? As, 9 


[2 


+ 
Tr 
2mr? 


(4.82a) 


Here T. can be interpreted as the operator of kinetic energy for a motion along 
the radial direction and L?/2mr? as the operator of kinetic energy for the 
rotational motion. From the above relation it follows immediately that 
bee |= = (). Therefore the kinetic energy and the square of angular mo- 
mentum can be measured simultaneously. 


4.10 Total Energy 


Corresponding to the Hamiltonian of classical mechanics, we define the Hamil- 
tonian operator as the operator of total energy: 


2 


Ht ities = —8 V(r) (4.83) 


3 


If we take for granted that the potential energy is only a function of distance, 
ie. V = V(r) (central potential), we have (H, L?] = 0; the square of angular 
momentum and the total energy can be measured simultaneously. Equally, 
(H, L,] =0. 

Since T = p?/2m and V = V(r) do not commute, no statement is possible 
concerning the exact values of potential and kinetic energy, even if we know the 


WD 


Fig. 4.4. A “sharp” angular 
momentum JL, its z com- 
ponent and the components 
L, and L,, which are not 
sharp. The vector L pre- 
cesses in an eigenstate of L? 
and L, around the z axis on 


a cone 
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total energy. Solely for the mean values of these quantities do we have the 
so-called virial theorem <T> = + <r-VV), the proof of which we defer for 
the moment (see Example 8.2). 


0°01) [a ae 


4.5 Proof of an Operator Inequality 


Problem. Let A and 8 be Hermitian operators and C = —i[A, B]_ = 
—i(AB — BA), D = {A, B} = AB + BA. Prove the following relation for the 
expectation ae 


A?B? > 31(C)? + (6/71 
Solution. Let g(x, f) be an arbitrary state; 2EC, 2 = a + if, a complex number. 
We define 
0 <1(2) = {|(A + iB) el? dx 
= | p*(A — id* B)(A + iB) pdx 
= | o* A? pdx + |Al? [ o* B2 pdx + J o*(ABid — BAid*)@ dx 


Se Pe Se = hb 
With 


B2[a — C/2B?]? = 2B? — aC + C2/4B2 
and 

B?[ — D/2B?]? = p2B2 — BD + D2/4B? 
we now have 

A? + Bfo — €/2B2)? + B28 — b2/2B2)? — €2/4B? — D2/4B2 > 0 
But «, 6 can be chosen arbitrarily, i.e. 

APB? = (e> n> 


must hold, which was to be demonstrated. 


EXERC SK 


4.6 The Difference Between Uncertainty Relations 


Problem. Discuss the “uncertainty relation” 


AE At ~h 


> 
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What is the fundamental difference compared with 


Ax Ap ~ h? 


Solution. A (free) wave packet with a width Ax in configuration space has 
a distribution around a certain momentum pp with a width Ap in momentum 
space, where Ax Ap ~ h holds. Its group velocity is v = 0E/Op|, = p,- The time at 
which the particle passes a point Xo, is uncertain by At = Ax/v. On the other 
hand, the particle has an uncertainty in energy, 


_ OE 
op P = Po 
because E = E(p). Therefore we obtain 


he Ax Ap = Aw AE/y = Ai AE 


AE Ap=vdp , 


This is the origin of the “energy-time uncertainty relation”. Therefore, if we want 
to measure the energy of a state with satisfactory accuracy, a sufficiently long 
time is needed. If this is not possible (for example, because of the finite “lifetime” 
of a state), the energy of the state remains uncertain. From this point of view, the 
uncertainty relations Ap Ax ~ hand AE At ~ hare equivalent. But the physical 
interpretation is entirely different. A measuring apparatus can measure a given 
observable of a physical system at different times (e.g. position, momentum or 
energy). Then the time is given by the hand of a macroscopic clock, which is 
connected with the apparatus. Hence this time is not an observable of the 
quantum-mechanical system itself, but a parameter, which is described by a real 
number t. 


NCS [= SS 


4.7 Expansion of an Operator 


Let f be a function f(z): C > €, which can be expanded into a Taylor series 
F(z) = ganz". Then the operator f(A) can be defined by 


ee ee 


n=0 


for an “appropriate” operator A. 


Problem. a) Why is this definition incomplete? What is actually the meaning of 


lim S,=S ? 


n> co 


b) Prove that 
T(a) = exp(ip-a/h) with p= —ihV 
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is the translation operator; Le. we have for suitable functions w(x) 
T (a) g(x) = (x + a) 


Solution. a) The definition is incomplete, since we have not explained under 
what circumstances a sequence S(= Dees Au pot operators, which may be 
defined on the full Hilbert space H, Renae to an operator S. Unfortunately 
there are several nonequivalent convergence notions concerning operators. On 
the one hand we can say that S, > S means that for arbitrary vectors, y,(x)¢H 
the functions (vectors) $,,Qy(x) > S¢,(x) holds in this Hilbert space. On the other 
hand, we can assign a norm 


a | Oo. 
o(xjeH || py (2)|| 


to an operator O and define that S§, > S, if |S, — S|] > O holds in IR. Here we 
cannot go further into these problems (or others, like: What is a Hilbert space? 
What actually is the momentum operator?). To us, an exhaustive study of 
functional analysis seems to be indispensable for a mathematical comprehension 
of.quantum mechanics. 

b) As indicated in a), we will not concern ourselves with mathematical 
“subtleties”. 

Let w(x) be expandable in its Taylor series. Then we have: 


F(a) W(x) = exp (i2-* Juin 


-) “| SA [vee 


i=) 
=| 


Wix)=wix+a) , 


as the penultimate expression is simply the shorthand notation for the Taylor 
expansion of the function w(x + a) at the point v. 


DN —K 


4.8 Legendre Polynomials 


The so-called “special functions” of mathematical physics are solutions of 
specified linear differential equations of second order, which frequently recur. 
We will discuss a small selection of special functions in this and the following 
examples, namely the Legendre functions, the associated Legendre functions 
and the spherical harmonics. There are several possible ways to represent these 
functions: 


1. as special solutions of specified differential equations (Laplace equation, 
force-free Schrodinger equation), 
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2. by means of recurrence formulae; or Example 4.8 
3. by means of a generating function (i.e. an |r — r’|~' expansion), 


to mention only a few. A valuable aid to studying the Legendre functions 
(polynomials) is the generating function; therefore we place it at the beginning of 


our considerations. 
The Legendre Polynomials and Their Generating Function 


We have frequently met the term |r — r’|~' in solving potential problems: 


1 
J irl? + lr’|? = 2Ir| lr] cos 3 


por 


(1) 


Now we want to expand the root into a power series of the ratio r to r’. To 
that end, we designate r— as the smaller, and r. as the greater, of both values of 
rand r’. Then certainly r./r. < 1 holds and we obtain 


1 
r? +r’? — 2rr cos $ 
x 1 
2 
ie 1 — Dens G+ (= ; 
ro ee 

1 i 2 1 5 rev 
=— 1 +— cos 9+ —Geos- ?—1)(|— ] Fe? - (2) 

Ps Fs 2 Pes 


The %-dependent coefficients appearing here define the Legendre polynomials: 


: ae & (*) ricos a), 


BPs. 7a 

r ir > 1=0 
re [12% 00s 9+ (S) 
Ps rs 


aes ane) (3) 
1=09F> 
If we write cos = x, we find for the P,(x) (see figure): 
Eo) — al 
ees) ee 


P,(x) = (3x7 — 1) 
P3(x) = 3(5x° — 3x) 
P,(x) = 4(35x* — 30x? + 3), (4) 


or generally according to Rodriguez's formula, which we will prove in the next 
section, 


dee ; Legendre polynomials of 
aan = (>) lowest order 
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Example 4.8 


Mathematical Properties of Legendre Polynomials 


We recognize that the Legendre polynomials can be introduced as expansion 
coefficients of a power series”: 


(1 — xt + 1?) = S Pix)t! (6) 
1=0 


with x= cos ? and) )2)-air 

We call the function (1 — 2xt+1t7)~'/? the generating function of the 
Legendre polynomials. By means of the generating function, we now calcu- 
late a recurrence formula for the Legendre polynomials. To that end we define 


Fux) == tee ye eee (7) 
1=0 
The first derivative with respect to ¢ yields 
OF x—t 
si 5g Nd 
ot 1—2xt4+t? 


? 


ie.) 


(l= Inf 277) ss I PO) =o ee (8) 
1=0 


1=0 
Comparing now the same power of t on both sides of the equation, we easily 
obtain 


(+ 1)P4, — (1+ 1)xP,+ IP,_, =0 : (9) 
A second recurrence formula may be obtained from F(t, x) by differentiating with 
respect to x: 


OF 
1 — 2) *\__ = tF 
( ae te (10) 


and by an analogous procedure: 
PO) — 26P) (25) Pa oe (11) 


with ' = 0/dx. From these recurrence formulae we easily find the relations: 


Pe — ee 5 


XP, — Pi, =P, 5 
ei eee Is 
Cee = 1) P; = IxP, — IP,_, A (12) 
If we now inspect the generating function for x = 1, we find that 
] fea) 
SD erie 2 sa yea) (13) 
1=0 


? The series converges for |t} < 1 and 9 e€[0, x] and can be differentiated term by term 
arbitrarily often with respect to r and & whereby the series so obtained converges 
uniformly with respect to (t, 9) in [—to, to] x (0, x] for arbitrary |to| < 1. 
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and therefore 


2) alee (14) 
Analogously, we find for x = 0 that 


1 1 ee 
F(t, 0) = ——— = 1-2 +--= POP. 15 
== 1-5 Y PHO) (15) 
and therefore 
0 for | odd 
PO=(C- DN)? a) 
a2) Or even 
The double factorial, characterized by two exclamation marks, is the product 
over the odd numbers, e.g. 7!! = 1x3x5x/7. 
Next we derive the so-called Rodriguez recurrence formula, 
nd ae i 
AS) = eg 


To do so, we use (6) and find the representation 


ge 
(1 — 2xt + 07) *? 


P(x) = Nar 


(18) 


t=0 


Now we expand the generating function (1 — 2xt + 7)" '/2 of P, into powers 
of ft: 


ile Ost Ajo > ( “1? \i-asera 4 12) 7pm 


n 


= eo TP canytentan (19) 
ie n m 
Therefrom we obtain 


ee |, 


nym 


x (n + 2m)! (—2 dp ees 
(n —1+ 2m)! 


Here the sum contains only those terms for which n + 2m > I holds. For t = 0, 
only a contribution from the terms of the sum with m = (I — n)/2 remains. Thus 
we obtain 


2 ie 1) i 
rd = 5 n Wont aan 


= (i +n)! 
= __ 1)G1/2) dn) eu 20 
d ) ; I+n : ) 


eh 
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If n = 2m — | is inserted, we get 


ae) see ae 
© ae ie 


oe 


mr 


By means of the binomial theorem, we now can easily see that 


1 a! 


10) nes 


C1) 


holds, which was to be demonstrated. 

From Rodriguez’s formula (17) immediately follows a symmetry of the 
Legendre polynomials: 

P(—x) =(-1)IP.Q) | (22) 


A further important property of Legendre polynomials is their orthogonality. 
Thus we consider 


Linn = f Pin(X) P(x) dx (m <n) 
coll 
1 1 +1 d™ i qd" 
= —— (7 a1)" aan 2 
eee ih J Fe ) le ) Jas 29) 
Partial integration yields 
(—1)" ae il qmtn - : 
Inn = Smal I ama _ 1) Ce 1) abe. (24) 


From this we obtain I,,,, = 0 for m <n, since 
Gere 


dxmtn 


Peale. (25) 


For the case m = n we get 


(— ty? +1 a " dz" r 
In = Fray? EAA (x? — 1)", (x? — 1)"dx 
= {ye toll 
= a a G — 1)?2n(2n — 1)(2n — 2)...[2n — (2n — un fas 
i Cne r 
= Sas J @?= Irae (26) 


By means of the variable transformation x = 2u — 1, we find 


(—1)"2(2n)! | 
I= ef (u— 1)"du = 


2n+ 1 
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If we combine the two results, we obtain 
oF SI D 


x) dx = ———— 
se ea (28) 


These are the orthogonality relations for Legendre polynomials. 

As a further point of interest, directly demonstrable by taking the ortho- 
gonality relation as a basis, every function f(x), which is continuous and 
bounded in the interval —1 < x < 1, can be expanded in a series of Legendre 
polynomials: 


foes 2GRe. (29) 
=0 


The function f(x) can be expanded into P,(x) if the coefficients c, of the 
expansion can be found uniquely. To that end we calculate 


pan 00 ar dl 
J Pal fejdx= Yen J Pm(x)Pa(xpdx 
= n=0 =a 
= 2 2 
a eae = mas 
oe : eee i aoe 26 il G02) 
and thus 
D 1 ae il 
ee 
-1 
We oneal oss wi 
= c* — 1)"—_ oe 30b 
oper) inane (30b) 
In particular it can be immediately shown that 
ee! i 
So x y= YP ')P.) 31) 
n=0 a 


holds. Hereby, the completeness of Legendre polynomials was the requirement 
for this procedure. In fact, according to Weierstrass’s approximation theorem®, 
every function f (x), which is continuous on a compact interval, can be uniformly 
approximated by polynomials, ie. the set of functions {1, x, x’, oe eis COM 
plete on this interval. The Legendre polynomials are obtained by application of 
E. Schmidt’s orthogonalization procedure, which of course in no way affects the 
completeness. 
The Legendre polynomials are solutions of the so-called Legendre differ- 
ential equation: 
Zs 


(1 — x2) 5 Pale) — 2x 4 Pala) +n(n+ 1)P,(x)=0 . (32) 


3 See E, Isaacson, H.B. Keller: Analysis of Numerical Methods (Wiley, New York 1966) 
Chap. 5. 
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Now, with the aid of recurrence formulae (12) derived earlier, the above proposi- 
tion can easily be proved: 


(ce = Pe = nxP,(X) ae WE pa) ’ 


dx 
d 5 a d d 
Sap Z fae aa = z ald 5 \ = a x 
2x Ac P,(x) + (x Deas: edo) a= Ges ae P,(x) om 1(X) 
(33) 
Thus we obtain 
Ge : d 
(i= x53 5 n)x- ae pO) = ies ge 1 (x) (34) 
Insertion into the differential equation yields 
d d d 
0 = 2% ae P,,(x) — nx i. P,(x) — nP,(x) + na Pn a 
d 2 
ee (35) 
dx 
and finally 
d d 
RE) — a ann) Se (36) 
dx dx 


ic. one of the first recurrence formulae in (12), which proves the above propo- 
sition. 

We do not obtain the Legendre differential equation from the Laplace 
equation in spherical coordinates after a separation of variables, but the asso- 
ciated Legendre differential equation (for the angle 8, x = cos &): 


2 2 
1 20) | nin + 1) 


n? 

= Hees Os (37) 
For the case i — 0. this differential equation transforms into the Legendre 
differential equation. The general solution of (37) is the associated Legendre 
polynomials P}"(x): 


d™ 
P™| = fj = 2 inj 2 P.(& 
SG) (ll = sue HG) (38) 
1 Anh gine Ye ee : 
la a ee (39) 


The following orthogonality relation (n is replaced by / here) can easily be 
derived: 


ap I 
m m’ (aR ae z 
Py (eden (saab. 
A es ac 


ae On i: . (40) 


Usually every function of an orthogonal system is furnished with a factor so as 
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to yield the value one for the integral over the square of each function. We then 
say that these functions are normalized. A system of normalized orthogonal 


functions is termed orthonormal. We easily derive from (40) the normalization 
factor 


(21+ 1)(/—m)!) 1? 

eg ee (41) 

2(/ + m)! 

It will prove useful to define the associated Legendre polynomials for negative 

values of m, too. Since the differential equation (37) transforms into itself when 
mis replaced by —1, 


[=i 


—m I —m/ d 
le es = 57 =) Oe = |) 


is also a solution of the general Legendre differential equation. This solution is 
a polynomial in x of order ! and is continuous for x = +1, too. Therefore the 
solutions P;" and P,; ™ can differ only by a factor for fixed / and m(0 <m < 1): 


Pe iy Arp (42) 


We now determine the constant A by setting x = 1 and dividing by (1 — x7)”. 
With 


1 = x2 (2-1) eel 
( ae, 7 = ean (1) 2 = 
= I! (1+ m)! gitm 
= I-m 6. j 
=) ase 
we find 
(| — m)! 
A=(-—1)" ; 
eo 


and therefore 


a) ee 
Te aye (5c) (43) 


Pe) (= 


In the next example we will introduce spherical harmonics. To that end, we 
make a short digression to the Laplace equation. This digression will help us to 
understand better the physical significance of the differential equation discussed 
before. Furthermore, we will see that in order to construct the spherical har- 
monics, in addition to the Legendre polynomials P,,(x), the associated Legendre 
polynomials P,"(x) are needed, too. 
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CONIA? 0 ES a eee ae 


4.9 Mathematical Supplement: Spherical Harmonics 


The Laplace Equation in Spherical Coordinates. A scalar potential U, outside 
a charge distribution, satisfies the Laplace equation* 
@2 aes 2 


z +5) Heep =o (1) 


U(x, .2)= (oat 53 622 


If spherical coordinates, (r, 9, @) with 


X=Ffsin 9) cos ) ) =n sins sine 


GT CCS 1) ae (2) 
are introduced, we find: 
G20) ally e- 0 eee 
a nee eee t 3 ot 3 
- [oat 25 + a (gqrt ata) ©) 
This can be further simplified to 
ia? 1 0 0 1 CF 
-—= — | sin J— >= PU =0 -. 4 
\ sat tracy agli i) * aaa @ 
This differential operator divides into a radial part and an angular part Ihe 
ee = ihe 
eigen esa) = 5 
{" aan =f Ute 7,0) eee With (5) 
~ Il) ee 0 ieee 
L? = —<(—~— —sin §— + ———>> . 6 
‘ara a7 ee ©) 


We now state that L? is proportional (up to the factor h) to the square of the 
well-known angular-momentum operator [see (4.73—75) | 


b= ie ve (7) 


We have been directed automatically towards this operator in the course of this 
chapter [ (4.64) pp. ]. Here we interpret it solely as a mathematical tool, which 
allows us to formulate some operations more concisely. 


* In the following we will introduce spherical coordinates as we did in (4.73—75) for the 
angular momentum operator and draw attention to the fact that we already know the 
(3, ~)-dependent part of the differential equation from (4.73—75) on. It will be considered 
again in the discussion of the “hydrogen problem” [cf. (9.11)]. See also Chaps. on 
potential theory in J.D. Jackson: Classical Electrodynamics, 2nd ed. (Wiley, New York 
1975) and W. Greiner: Theoretische Physik, Band 3, 4. Aufl. (Harri Deutsch, Thun und 
Frankfurt a.M. 1986). 
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The operator L acts only on the angles 9 and g. It has the components 


‘ 0 0 adi fale 0 0 
) —2Z = — = 
L, va ay 1{ sin P5qt cos pg cot F ae 


ical ae et ae Os 6 
¥ Be | ee SSO ia aa 
x eee) 0 a) 
CaS) | — 1 
4 i (s ay y =) i Bp (8) 
We can easily verify that 
Poi ie ee (9) 


holds [see (4.75)]. Earlier in this chapter we became acquainted with the 
angular-momentum operators L;. The operators L, differ by a factor h from 


A 


those operators L;, ie. L; = hL;).> 
In order to solve the Laplace equation we use a separation of variables 
procedure: 


U(r, 9, g) = R()O(9, @) (10) 
Thereby we obtain 
r(67/dr?)(rR(r)) _ L? 
Ry ~—é<COOS 


=iieethy, (11) 


where we have chosen the factor /(/ + 1) as separation constant without restric- 
tion of generality. We now have 


re Rete RO and (12) 
or or 
L20(9, y) = 11+ 1I)O(9, 9) . (13) 


If now we choose 


0 = P(E) . 04) 


for the angular part, we find with the new separation constant =m: 


1 (d/d9)[sin 9(d/d9)|P(9) om? 


= =—/(l+1) , 
sin 9 P(§) sin? 3 ( ) 
2 2 
(O7/0p*)E(P) _ 2 15) 
E(9) 
A solution of the g-dependent part is 
E(g)=ce™” ; meG , (16) 


5 The algebra of angular-momentum operators is investigated in detail in W. Greiner, 
B. Miller: Quantum Mechanics — Symmetries, 2nd rev. ed. (Springer, Berlin, Heidelberg, 
New York 1994) 
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where we required that E(@) be periodic with 27, corresponding to the chosen 
symmetry. For the radial equation one obtains 


RO) =cut ai Ce (17) 


We find for the $-dependent part with the abbreviations 


cos =x and sin f= /ll—x? , 


é ae, tee 3 8 

= = ee 

ae : Oe 0 

=——Ss"= 5 18 
09? u ree aa ; Uy) 


the transformed differential equation (associated Legendre differential equation), 


A 
(oe a ju p-“atp=o (19) 
with the associated Legendre polynomials P}"(cos 2) as solutions (cf. Example 
4.8). Thus we obtain 
O(9, p) = Cl Pi"(cos 3)e""? 
Wt — Odes 2) | ee Ne (20) 


Concerning the fact that / is an integer, we refer the reader to Example 4.8. The 
functions thus characterized by two integers / and mare the spherical harmonics 

O(9, 9) = Yin( 9, 9) = Ci"Pi"(cos jem" (21) 
In general the constant C;” is fixed in such a way that the spherical harmonics 
are normalized. To that end we recall that 


a7 2(I + m)! 


i dx! Py"(x’) P(x’) = Aeon = al pes (22) 
holds [cf. (38-40) in the preceding example] and calculate easily 
ii dp el"? = I7n5 am. (23) 
0 
Then we have 
J dd Vb B, 0) Yi 95) = [Canl? seg gry i Sm 04 


Customarily, in the literature, the normalization constant C,,, is fixed so that 


ae Yrn( 9, ~) fellas ~) = Ou O mm’ (25) 


holds and thus 


214+ 1(!—m)! 
Cha [eee 
Me 4n (I +m)! ° 2) 
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eo), ns 
Yim(S, p) = (esa | (COs Fees, (27) 
We point out to the reader that for negative m, 
P, "(cos 3) =(— yl ae Tee (cos #) (28) 


holds. — we find a symmetry of the spherical harmonics: 
ie vA ~)= =) Yin(9, oD) : (29) 


One of the most important properties of spherical harmonics is that every 
bounded function f (9, @) defined on the surface of a sphere can be expanded in 
a series of the Y,,,(.3, @): 


23S ae. (30) 


1=Om=-lI 


Using the orthonormality of the Y,,(.%, @), we determine the expansion coeffi- 
cients d;": 


dl” = J dQS(9, 9) Yim( 9, @) (31) 
Q 


EXAMPLE re 


4.10 The Addition Theorem of Spherical Harmonics 


In the a ue we shall prove that 


(cos 9) = 3 Y YES. 0) Yn, 0) (1) 
m=—lI 


holds, where 


cos y = cos & cos # + sin 3 sin 9’ cos(y — ¢’) 
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This relation is termed the addition theorem of spherical harmonics. In order to 
prove it, we expand a Legendre polynomial of order / for the angle y in spherical 
harmonics. Let x’ be fixed in space. Then P,(cos y) is only a function of 9, g with 
3’, gp’ as parameters. Therefore P,(cos y) can be expanded: 


i°.4) 


i 
P,(cos y) = Y ys Arn(¥, g') Yom, ¢) S (2) 
v=O0 m=-?r 
Comparison with (1) shows that only terms with /’ = | seem to appear. In order 
to understand why this is so, we assume that x coincides with the z axis. Then 
P,(cos y) satisfies the equation [cf. Example 4.9, Eqs. (5, 13, 20) ] 


aa 


V 7P (cos y) = BD aes vy) =0 (3) 


(differential equation for spherical harmonics, y is the usual polar angle), which 
can be checked easily if V’* is written in spherical coordinates. If we now rotate 
the vector x to its old position, then V’* passes to V’, and r remains unchanged 
(V-V is a scalar product and therefore rotationally invariant). Therefore P; still 
satisfies (3); consequently, P; is itself a spherical harmonic of order /. Thus P; can 
only be represented as a linear combination of Y,,, of the same order /, and our 
separation of variables equation reduces to: 


P,(cos y) = y Ary a Je, ee (4) 
with 
Aim(9', p') = J Yin(9, o)P(cos y) dQ. (5) 
Q 


Now we want to determine the coefficients A,,,. To that end we examine the 
Yim(9, p). By expanding the Y;,(9, @) into a linear combination of spherical 
harmonics with angle y and Pp: 


Yin( 9, ~) = as Cim' Yom’ (y, B) 9 (6) 
m= —T 

with 

Cim' = i Yin(9, ¢) Yim (7; B)dQ . (7) 

Q 
Here, y and f are functions of 9, 9’, y, g’. If now we choose m’ = 0, we obtain 
2h eal ) 12 
* 

Yio(y, B) = a P,(cos y) ; (8) 

then 
oe oe 
Cio = re Hees VY) Vin, p)dQ . (9) 


Comparison with (5) yields 


ee eee 
Ain(9'.0!) = tor} Cio - (10) 
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D5 


We now seek an equation for the cj and therefore inspect (6) for y = 0: 


I 


Vino, ~) = L Cim' Yim (), p) > with 


m= —1 


(9, g) = L9(y, B), o(y, BI. (11) 
We obtain for y = 0: 


21+ 1) 1/2 
Yin(S, Q)= oad Cio , (Ov (12) 


y=0 


On = cae al eee 
= Vref VmOLA AL ely BD] (13) 
If we insert this into (10), 
rd f An f / 
Aim( > ) = ae Yin » ) (14) 


21+ 1 


follows, since $ and @ pass to 3’ and ¢’ for y > 0. Thus proposition (1) is proved 
if we insert this result into our separation of variables equation (4). Often it 
turns out to be more advantageous to express (1) in terms of the P;”. If we bear in 
mind that 


a (1 — m)! 
Pee a= (— 1)" Be th 
OS(s anti te (15) 
Pi(cos y) = Pi(cos 3) P;(cos 3") 
' (—m)! 


25), 


anal P;"(cos 3)P/"(cos 3')cos [m(g — g')] (16) 
m=1 : 


results. For y = 0, we find a formula concerning the squares of ¥,,: 


ae) 


», Lees ) |? i he ’ (17) 


Here we have used (14) from Example 4.8. The properties of the spherical 
harmonics derived here are extraordinarily important. We will encounter the 
spherical harmonics ¥,, time and again and come to appreciate them. 


m=-l 
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HERMITE, Charles, French mathematician, * Dieuze 24.12.1822, + Paris 14.1.1902. H. 
grew up in a comfortable, bourgeois family as the son of a textile merchant. At an early 
age he became so involved in research that he had difficulties passing his obligatory 
examinations. He studied for just one year at the Ecole Polytechnique and only with help 
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of friends became qualified to teach in 1847. His scientific results, mostly on elliptic 
functions, modular functions, theory of numbers and invariant theory, were recognized 
only late. H. coordinated the ideas of Gaussian arithmetic, Abel’s and Jacobi’s elliptic 
functions and Cayley’s and Sylvester’s algebraic invariant theory and developed them 
further. Not until 1870 did he become a professor at the Sorbonne. In 1873 he proved 
the transcendency of e. He was in correspondence with many famous contemporaries and 
was determined to break down national barriers in the scientific struggle. He was 
a teacher and promoter of Stieltjes, Darboux, Borel, Poincaré and others. 


SCHMIDT, Erhard, German mathematician, * Dorpat (Tartu), t Berlin 6.12.1959. 
S. studied in Berlin and Gottingen, graduated in 1905 and became a professor in Zurich 
in 1908, 1909 in Erlangen, 1911 in Breslau (Wroclaw) and 1917 in Berlin. He worked 
primarily on integral equations and isoperimetric problems’ 


WEYL, Claus Hugo Hermann, German mathematician, *Elmshorn 9.11.1885, 
+ Zurich 9.12.1955, became a professor at the ETH Zirich in 1913, 1930 in Géttingen; 
went to the Institute for Advanced Study in Princeton, USA in 1933. After working on 
the theory of differential and integral equations, W. connected topological considerations 
with the concept of Riemannian surfaces and made great progress in the theory of 
uniformalization. His fundamental publication, Raum, Zeit, Materie (1918, 1961, lec- 
tures), originated from meetings with A. Einstein. He developed an integral method for 
the representation of mathematical groups used in quantum mechanics, which contrasted 
with the infinitesimal method used by S. Lie and E. Cartan. W. supported intuitionism 
(a method for a constructive foundation of mathematics) and tried to closely combine 
mathematics, physics and philosophy in his own work. He was first in considering local 
gauge invariance as a general principle in theoretical physics. 


LEGENDRE, Adrien Marie, French mathematician, * Paris 18.9.1752, + Paris 
10.1.1833. L. greatly shared in the foundation and development of number theory and 
geodesy. He also made important contributions to elliptic integrals, to the foundations 
and methods of Euclidean geometry, to variation calculus and to theoretical astronomy; 
for example, he first applied the method of least squares and calculated extensive tables. 
L. concerned himself with many problems in which Gauss was interested, too, but never 
reached the latter’s perfection. From 1775 on, L. worked as a professor at various 
Parisian universities and published outstanding, influential textbooks. 


DIRAC, Paul Adrien Maurice, * Bristol 8.8.1902, + Bristol 1984. D. studied at Bristol, 
Cambridge and at several foreign universities. He was appointed professor of mathemat- 
ics in 1932. D. is one of the founders of quantum mechanics. The mathematical equivalent 
created by him consists essentially of a noncommutative algebra for the calculation of 
the properties of the electron; he predicted the existence of the positron in 1928 and 
contributed fundamentally to quantum field theory. D. was awarded the Nobel prize 
in 1933. 


5. Mathematical Supplement 


5.1 Eigendifferentials and the Normalization 
of Eigenfunctions for Continuous Spectra 


We begin our discussion with the eigenvalue equation 
L(x, L) = L(x, L), (5.1) 


which is supposed to have a continuous spectrum with the eigenvalues L and the 
eigenfunctions w(x, L). Now we integrate (5.1) with respect to L over the small 
interval AL, and obtain 


a L+AL | te 
AV ya | LK, Ld, (5.2) 
ip 
where 
LeeAnn co 
AW(x,L)= Jf Wx, L)db (5.3) 
L 


is called the eigendifferential of the operator L, introduced (as mentioned earlier) 
by the great mathematician, H. Weyl. The eigendifferential is a special wave 
group which has only a finite extension in space (in the x domain), similar to the 
previously studied wave groups; hence, it vanishes at infinity and therefore can 
be seen in analogy to bound states. Now we show that indeed the functions 
W(x, L) are not orthogonal, but the eigendifferentials 4(x, L) are. Furthermore, 
because the A(x, L) have finite spatial extension, they can be normalized. Then 
in the limit 4. > 0, a meaningful normalization of the functions w(x, L) them- 
selves follows: the normalization on 6-functions. 

To begin with, we form the complex conjugate expressions of (5.1) and 
(522 ite: 


To oe 15 Near Wy (0 (5.4) 


L* Aw* (x, L') = ens yal = (5.5) 


hot 


where we have renamed the continuous eigenvalue in L’. Multiplication of (5.2) 
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L 
an Ay (z, L) 
L 
AL} + A¥(e,L/) 
it! 
a 
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Ay (x, L) 
AL a Wiley 
SL 
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Fig.  5.la,b. Nonoverlap- 


ping (a) and overlapping (b) 
intervals in the eigenvalue 
spectrum 
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L'), and of (5.5) by A(x, L), and subsequent subtraction yields 
Aw (x, L)L* Ap*(x, L')] 


by Ap *(x, 
{ dx [Ap *(x, L) LAW (x, L) — 
L+4AL L'+AL' 


fai | 
i iy 


Since L is Hermitian, the |.h.s. vanishes. As the intervals AL and AL’ should be 
small, we can place (L — L’) outside of the triple integral, by using the mean- 
value theorem from integral calculus, and obtain 


(L — L') f dx AW*(x, L)AW(x, L) = (5.6) 


In the case where the intervals AL and AL’ do not overlap (see Fig. 5.1), L 4 L’ 
holds, and from (5.6) the orthogonality of the eigendifferentials follows: 


[dx Ap*(x, L)Ap(x, L)=0 , for LAL’ 


=| dbs ge (b= Ok aa) 


(5.7) 


The situation is different when the intervals AL and AL’ overlap (see Fig. 5.1). 
First we show that the integral 


N = { dx A*(x, L) A(x, L) (5.8) 
is small, of the order of AL. We can see this by writing for (5.8): 
ff Ae —s 
N= (ac Ay (YAW) — dx Aly (oe ees) 
iL 
is ie. 
= GS ANE IG) | We ee (5.9) 
Ly 


where L, and L, are chosen in such a way that the interval (L, L + AL) is 
located inside of the interval (L,, L2) (see Fig. 5.1). As a result of the ortho- 
gonality (5.7) of the eigendifferentials, the contribution of the intervals (L,, L) 
and (L + AL, L,) to the total integral vanishes in the last step of (5.9). If we let 
AL tend to zero, it will become apparent that N tends to zero like Ay *(x, L); 
hence it is proportional to AL. Therefore we can always achieve, by an appro- 
priate normalization of Aw(x, L), that 


Be =I |; (5.10) 
1.€. 

{ dx Ap*(x, L) A(x, L) = AL 
for AL 0. 


We now combine results (5.7) and (5.10) in the following orthogonality 
condition for eigendifferentials: 


AL for overlapping intervals 
(L.L4F AL) and, LAs) 
for nonoverlapping intervals 
(2 AL) andes etre |) 


(abe aN Ce We Adie: JE) = 
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This allows a further transformation for small AL, namely: 


lesb AVL 
J dx Ap *(x, L)Ap(x, L) = { dx AW*(x, L’) (Gaya, 
iL 


= fdx Av*(x, L W(x, LAL = AL for overlapping intervals 


0 for nonoverlapping intervals 
(hl) 


After division by AL this becomes 
1 if the point L’= L lies within 
the interval (L’, L', + AL’) 


O if ZL does not lie within the interval 
(eA) 


Pe Ay ee eta) 


This can also be written as 


Tey stp alle? 


~ ae 1 . = goo : . , 
{ d&’ { dx — W*(x, L)W(x, L) -{ if L=L in the limit AL’>0 
os 


0 if L#L’ in the limit AL'’>0 
(Gal2) 
The expression 
Wee Oa Winey —6(h— 0) (5.13) 


obviously must be Dirac’s delta function, already familiar to us from elec- 
trodynamics, for which, according to (5.12), 


ee = 1 if ZL lies within the interval AL’ 
Per a= i) ee ; 
iL’ O if ZL does not lie within the interval AL’ , 
(5.14) 
holds. From this relation, we immediately get the familiar property 
en ae ee V(Ly i fh is located in (a, 0) 
B 25 / dL’ = 4 55 
J ee) if L is not located in (a, b) Ce 


for, according to (5.14), 6(L — L’) has to be localized as a function of L' around 
the value L so as to always yield for the integral fe +40’ di! over L the value 
OMe (SSE FIB, 52) 

Normalization (5.13) for the functions w(x, L) of the continuous spectrum 
has grown out of normalization (5.11) for the eigendifferentials. Instead of 
speaking of the orthonormalization of eigendifferentials, we can say according 
to (5.13): the functions w(x, L) of the continuous sectrum are normalized on 
6-functions. Hence this normalization on 6-functions in the continuous spectrum 
corresponds exactly to the normalization on Kronecker’s 6,, in the discrete 


spectrum. 


L 


L! 


Fig. 5.2. Two functions 


which approximate 
o(L — L')-function 


the 
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5.2 Expansion into Eigenfunctions 


We make the mathematical assumption that all eigenfunctions of an operator le, 
which we call y,(x) and which belong to the eigenvalues L,, constitute a com- 
plete set of functions. By this we mean that each arbitrary function w(x) can be 
expanded in terms of these eigenfunctions w,,(x): 

W(x) = S) anWalx) « (5.16) 
We can easily determine the a, because of the orthogonality of the y,: 
(Wil Wn > = Onm. Multiplying (5.16) on both sides by W,*(x) and integrating over 
x yields 


J v0) Wa) dx = Yi an J Ua) Wan) dx = YOu = An - a7) 
We should notice the analogy between expansion (5.16) and the expansion of 
a vector A = )’, a;e; in an orthonormal vector basis e;. Therefore the expansion 


coefficients a, in (5.16) can also be interpreted as components of the vector (state) 
w in the basis y,. If we insert (5.17) into (5.16), we obtain 


w(x)= > (1 Wo Witte’) a’) Wax) = J (x Wats’ WaCs) ) WO ds 


(5.18) 
In order for this identity to hold for each arbitrary function w(x), obviously 
Dhar (xa) = 5 — x’) (5.19) 


must be valid. This is the so-called closure relation, which also follows directly by 
expansion of the 6-function in the w,,(x): 


d(x _ x’) = yy AnW,(X) > a= J Wa (x) O(x a x')dx Sj Wc) ’ 


n 


and therefore 


d(x — x’) = Do Wat’) a(x) 


If a function as singular as the 6(x — x’)-function can be expanded in terms of 
the set w,,(x), it must be complete (or closed); hence the name “closure relation”. 


PNONID | SS eee ee aS 


5.1 Normalization of the Eigenfunctions of the Momentum Operator ), 


We know the eigenfunctions of the momentum operator 


ee 
Wy, (x) = C,, exp i" ) ; 


where C, is a normalization constant, which here has to be determined and 
can, in principle, depend on p,. px is the eigenvalue of momentum with 
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the continuous spectrum — 00 < p, < 0. In order to determine C, , we form, 
according to (5.13), the integral 


SJ VEO pO) dx 


merc, On fl exp( i= Pe) 
oe i 


dx 


Megs es = Pas 
= e* Gen limit exp | 4 
De oe i p( 1 hj ; 


2 sin(p, — px)n 
De = [Ps 
In Example 5.2 we will show that 


= * 5 
= CC, .h lim 
n> 00 


lim sin (nx) Sem) 


no TUX 


is valid. Therefore the above expression can now be written as follows: 
J VoL Cp,(x) dx = Cp, Cp, 20h 5px — Px) 


In order to achieve normalization (5.13) on 6-functions, 
i 


./ 2mh 


must hold. A possible phase factor e'?(Ps) which does not affect anything, was set 
equal to one in the last step. Consequently, the orthonormalized momentum 
eigenfunctions are 


Wp. = Ti exp ( 2%) 
The three-dimensional generalization of this obviously reads 
Wp) = pC) p,(V)p.2) 
Pxx + Pyy + re 


1 
en h 
Eas ae 
= nh) cxp( h ) 


These are, as is immediately clear, normalized as 


|C,|?2mh=1 , ie C 


| Uslobpind?r= | VECIWp, dx 


x f Vly (vdy J Weil) Wp.(2) dz 


= 5(p. — px)5(p} — Py)(pz — Pz) = O(p' — P) 


The last step comprises the definition of the three-dimensional 6-function. 
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x 


Two approximation func- 
tions ¢,(x), with J” @,(x) 
dx = 1, which tend to the 
6(x)-function for n— oo 


The functions @, = ,9(x) and 
n=20(X) 
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EXAMS LE 


5.2 A Representation of the 6-Function 


In accordance with the definition 


OC) =O efor. ol eand 


[| eG aks = i Os) dx — ll 


ea} 


as well as 


J far =fO | 


the 5-function must be an extremely singular function’. As already frequently 
mentioned, we can visualize that d(x) equals zero everywhere except at x = 0, 
where it takes on such large values, that the area between 6(x) and the x-axis 
yields exactly the value one (see besides figure). 

The 6-function can be represented in a more formal way by a set of analytic 
functions ,(x), so that 


5(x) = lim g(x) 


These functions g,(x) must have the property (see next figure) of constantly 
increasing at x = 0 for large values of n and decreasing continually for x 4 0, 
so that 


{ o(x)dx = 1 


remains valid for every n. There are many sets of functions which satisfy these 
conditions. In this case we speak of various representations of the 6-function. An 
especially advantageous representation is given by the functions 


ee sin nx a) 


HS 


where n is a positive number (n€ No). Obviously 


0,(0) = = (2) 


* A stringent mathematical foundation of the 6-function was given by Laurent Schwartz. 
We draw special attention to the elegant paper by C. Schmieden, D. Laugwitz: 
Mathematische Zeitschrift 69, 1-39 (1958). 
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In addition, the ~,(x) oscillate with the period 2z/n and have decreasing Example 5.2 
amplitudes for |x| 00. Moreover, for all ¢,, 

BS ean 

J Pn(X) dx = J ——_— ax = 1 (3) 

ae See 


holds. This 1s obvious for the limit n > oo. Then ¢,,(0) = n/z tends to oo, and the 
oscillation period nAx = 27, i.e. AX = 2n/n, tends to zero, so that the contribu- 
tion of the rapidly oscillating function to the integral vanishes in the domain 
x #0. Then only the contribution from the surroundings of the point x = 0 
remains (see figure). This yields 


nT 
C0) A= — | 
mH 
Hence functions (1) have all properties of the 6-function in the limit n — oo, and 
we can write 
sin nx 


6(x) = lim 


n> oO TX 


We list (without proof) some other representations of the 6-function: 


1 — cos t(x — Xo) 


: ler 
a) (x — Xo) = = iim a(x — Xo) 


le a 
b) d(x — Xo) = Tt oie (oS tqr ee 


c) Let 0(x) be Heaviside’s step function (see figure) 


ree 1 for x>0 
ie sO 


Then the relation 
Heaviside’s step function 
: _ Ox — Xo + 8) — Ox — Xo) 
O(x — Xo) = lim — 
6-0 re 
dO(x — Xo) 
dx 


holds. This means: the distribution 6 is the derivative of the Heaviside distribu- 
tion. The integral relation 


x — Xo ‘a for x= x5 


O(x—xXo)= | d(%’—xo)dx’= 


eee) 


lk fOr ees 


is immediately clear. 


d) 0 = iin : ex [ Sa | 
) at cameo Va p ne 


Dare 
e) 6(x — Xo) = im Jab 1 + U(x = Xo)? 1/b? 
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NAVA ee 


5.3 Cauchy’s Principal Value 


A further, very useful, relation follows from the identity 


1 ee x ie (1) 
Re ee re ee 2 ; 


If we inspect integrals of the form 


im f L004 


e>0 


eden lim | Sas dx 


+i lim i CS bec: 2 a aGhe (2) 


the last term on the r.h.s. yields, according to relation b) stated in Example 5.2, 


+i lim hy Ca) Price ax = in i, Wevolessea Se). (3) 


The first term in (2) can also be rewritten as 


lim ne ee) emerge ie 


. o£ dx — dx : X)X 
re MMA ee ce 


é 


=e J le + f(0) lim n J 


CBS 
x4 a o 


Here, P designates Cauchy’s principal value, 
© £(&) al ¢ Jeo ee ee 
P | ——dx=!1 : 
I x aa aes x ae 0) 


The second term in (4) vanishes because the integrand is an odd function even in 
the limit ¢— 0. Therefore we now can write (2) as: 


imi) 


20 2 ESE ely 


She OY (6) 


This can be summarized symbolically in the simple, often-used formula 


, leet ee 
lim SSP Felon). (7) 
; xe 
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5.4 The 6-Function as the Limit of Bell-Shaped Curves 


Problem. Show that the 6-function can be represented as the limit of the 
“bell-shaped curves”, 


Vixeel= nm eix + 671 (6 > 0) 


Solution. The bell-shaped curves become narrower and higher with decreasing 
é (see figure with e, < €2). We have 


NR a @ ir se 20 
eee Colon. — 0 


but the areas below the curves always have the value 


+o 


1 x 
yee) ao arctan — 
1 é 


+ 0 


=| 


% 


= 88) — © 


independently of ¢. Now we examine the integral 
are) 
Fe) = J fxy(s.edx 


for a continuous, bounded function f(x) as function of the parameter «. We 
substitute x = e¢ and obtain 


Fe = J fleég(ede . with 


1 1 +a 
= aa d do— 1 
H=—spq and J alG)as 
This integral now converges uniformly according to our assumptions, for an 
MeR independent of ¢ exists with |(f(e¢)g(¢)| < Mg(é). Now we have a 
theorem at hand which then guarantees that F(e) is continuous. Hence we have 
lim, .9 F(e) = F(0) and thus 


lim i f (x)y(x, e)dx = lim F(e) = F(0) 


é70 


= J fad = J [E9d(wax 


for arbitrary continuous bounded functions f, Consequently we can write 


6(x) = “lim y(x, 6)” 


The quotation marks should remind us that the limit ¢— 0 may not be per- 
formed before the integration over a test function. 


= Se 


\ 


Ye) 


£2 


Two bell-shaped curves 
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6. The Schrodinger Equation 


In classical mechanics it is possible to calculate, for example, the vibrational 
modes of a string, membrane or resonator by solving a wave equation, subject to 
certain boundary conditions. At the very beginning of the development of 
quantum mechanics, one was faced with the problem of finding a differential 
equation describing discrete states of an atom. It was not possible to deduce 
exactly such an equation from old and well-known physical principles; Instead, 
one had to search for parallels in classical mechanics and try to deduce the 
desired equation on the basis of plausible arguments. Such an equation, not 
derived but guessed at intuitively, would then be a postulate of the new theory, 
and its validity would have to be checked by experiment. This equation for the 
calculation of quantum-mechanical states is called the Schrodinger equation; let 
us now “derive” it. 

In relativistic classical mechanics, time coordinates and spatial coordinates 
as well as energy and momentum are respectively treated as the four compo- 
nents of a four-vector, Le. 


E 
nice r= (nic), psi Da. (6.1) 


By enlarging the operator representation of the three-dimensional momentum 
to a four-dimensional, relativistic covariant vector-operator we get 


es 0 0 60a @ 
ea 2a a oe 6.2 
(28) ina (< dy’ Oz sai) ee) 


Both sides of this equation are four-vectors. By comparison, the energy is 
replaced by the following operator: 

a 0 

E=th— . (6.3) 

at 

We remember here that we already had an operator for the energy in (4.83), 
namely the Hamiltonian H of a particle. Obviously we have two operators for 
the energy. Both E and the Hamiltonian H describe the total energy and can 
therefore be set equal. This generates the Schrédinger equation. 


Ew(r, t) = Hw(r,t) or ins wir, ) = Avr, with 


x h? 
H=——A¢4Vi(r) . (6.4) 
2m 
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Using the wave function of a free particle (de Broglie wave), 


i i i 
wir, t) = Aexp [= je — pn | Abe (FE pox] = Aexp (; i) : 
(6.5) 
we find that the operator E has the total energy E as an eigenvalue. 
The Schrédinger equation (6.4) is not a relativistic equation. Indeed, starting 
from 
Ee pe einge « (6.6) 
for the energy of a free relativistic particle, the free Klein-Gordon equation 
follows, Le. 


—h’ a UGRO=(— reat mcr) | (6.7) 


The Schrodinger equation and the Klein-Gordon equation are linear differ- 
ential equations; this means that with yw, and w, the function defined by 
w = aw, + biz is a solution, too. This is the mathematical formulation for the 
principle of superposition, which was discussed in Chap. 3. Schrédinger’s equa- 
tion is of first order in time and second order in space; the Klein-Gordon 
equation is of second order in both space and time. We suppose that the wave 
function at time ty contains all the information about how the state propagates if 
there are no external perturbations. Only Schrodinger’s equation as a first-order 
differential equation in time satisfies this requirement. The Klein-Gordon equa- 
tion, being important in relativistic quantum mechanics, needs to be reinter- 
preted. The Schrodinger equation (6.4) contains the imaginary unit i as a factor, 
which implies that oscillating solutions are possible. 

It is separable into time and space, if the Hamiltonian H = H(r, p) is not 
explicitly time dependent: 


wrt) = Wn) f(D) 
and therefore 

’ 0 Ps 

Mv AO =LHVWIS(O - (6.8) 
[Since w(r, t) and (r) are two different functions, this should not lead to any 
misunderstanding. ] After separating the variables, one finds the equation 

pl _ HY) 

fi) Wr) 


This means for the time-dependent function 


— CONSt— ee (6.9) 


f(t) = const exp (- iF) : (6.10) 


The function with the spatial argument y(r) solves the stationary Schrodinger 
Equation 


Hy(r) = Er). (6.11) 


6. The Schrodinger Equation 


The wave function w(r,t) is periodic in time, with the phase factor 
exp[—1(Et/h)], and that is why the densities y*y and also, as we shall see, the 
currents are time independent. Equation (6.4) is an eigenvalue equation of the 
Hamiltonian, with E being the real energy eigenvalue. The general solutions of 
(6.4) are oscillating functions in time, 


Walt, t) = W,(r)exp (- i =) > (6.12) 


with the normalization 
fWe(r, OW DdV = [WFLA dV aL. (6.13) 


Any stationary state corresponds to well-defined energy and to an infinite 
stability in time. It has the character of a standing wave because the density of 
probability given by y*w is time independent. This is not true for a linear 
superposition of stationary states. 


CRS = —— SS 


6.1 A Particle in an Infinitely High Potential Well 


Problem. A particle of mass m is captured in a box limited by 
Crewe . Ux yabp = 0s7<c 
The corresponding potential is given by 


ve 0 iO ee ees Oye 2 Oz ec 
~ ) oo elsewhere 


(See figure.) 


Solution. The Hamiltonian is given by 


2 


eet a ns h 
oe me AC ae (1) 


Inside the box: Here we have the potential V = 0 and the following stationary 
Schrodinger equation: 
ee 6? 0" 
ie les ae Noy 2 = Lye) (2) 
2m (State CE ea 


We are going to solve this problem using the well-known separation of variables 
procedure: 
W(x, ys 2) = Wi) o(y3(2)_ - (3) 


This leads to three separate equations connected by the constants of separation 
— k? (the constants squared are chosen to be negative, this does not conflict 


0 a 


The potential 
X-axis 


along 
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with the general case, since the constants themselves are allowed to be 
imaginary): 


Ve) eee = ee (4) 
The solutions of these different equations are simply 

W(x,;) = const sin(k;x; + 6,) 
The total solution inside the potential well is therefore given by 

W(x, y, Z) = A sin(k,;x + 6;) sin(kay + 62)sin(k3z + 63), () 
where A is a normalization factor and 6; are phases which must be determined. 


Outside the potential well: The wave function must vanish here, because V is 
infinitely large; otherwise there would be an infinitely large potential energy, 
since <w | V(r)|W> diverges. Since the wave functions have to be smooth, we get 
two sets of boundary conditions: 


W(x = 0, y, 2) = W(x, y= 0,2) = W(x, y,z = 0) = 0 
and 
W(x =a, y,2)= WX, y=), 2=(x,y,z2=c)=0 . (6) 


The first set requires 6, = 6; = 63 =0. The other set gives a quantization 
condition: 


Ge — iy 8 Uk = foe Ck, — ie 


and therefore 


pe eee =n ee ee (7) 
a b c 
Here, 1), 1z,n, = +1: 42, + 3; ... are independent quantum numberse lie 
possibility of choosing n; = 0 must be excluded, because the corresponding 
wave function (see the end of this exercise) would vanish everywhere. The total 
energy Is 
2 


h 
Be we aaa) (8) 


it can have only discrete values, namely 


h fe Te HON 
E = — ae a a 


This discrete energy spectrum is converted into a quasi-continuum when the 
mass m or the extension of the box becomes very large. The lowest energy value 


he ene AN mw \* 
arly) el ey] < 
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is not zero, as one would expect classically. This is the first example of non- 
vanishing zero-point energy (see the extended discussion in connection with the 
harmonic oscillator, Chap. 7). 

The solution inside the box 


Vosnans(t) = A sin(kyx) sin(koy) sin(ks2) (11) 
must yield a unit total probability, which means that 


1 = f Wns Wares d 


a b c 
= |A|* { sin?(k,x)dx [ sin?(kay)dy | sin?(k3z)dz 
0 0 0 


te abc AP? 
== (12) 
and therefore the normalization factor equals 
D 
[Al eee ae (13) 


The energy spectrum is shown in the next figure. 

We took a < b < c; therefore the level E,,, is energetically higher than the 
levels E,5, and E,,2, and the relation E2,, > E,2; > E112 holds. In the case in 
which a, b, and c do not differ too much, all these levels are close together. Then 
we speak of a triplet (in general a multiplet), of states. For a = b = c, the particle 
moves in a cube, and all states belonging to a triplet are degenerated in energy. 
We have then 


Boi. = Exar = Est . (14) 


The wave functions of the three states are 


Beet CT ee eee 
Eis =o) 35 — sin — sit 2, 
a a a a 


oy PR 
Bevin) 3 Sill 4 sin yisin 2 
a a a a 


So) eee 
Ey12) Wii2= poe Ce : (15) 


If we break this degeneracy slightly, the volume is approximately that of a cube 
and the three levels are close together in energy, as we just pointed out. For 
states of higher energy we observe an equivalent phenomenon. For instance, 
there are two triplets (because of a slight break in the cube’s symmetry) close 
together, namely W221, W122, W212 and W311, W131, Wii3, followed by one single 
state (a singlet), namely 222. Such multiplet structures of states are identified 
with “shells”. Shell models explaining shell structures are important in atomic 
and nuclear physics. For example, in nuclear physics all nucleons in a nucleus 
are supposed to be in a potential well. Of course this potential well is spherically 
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b) 


Nuclei in (a) the ground 
state, (b) the excited state; 
(x) stands for a proton, 
(©) stands for a neutron 


Magic nuclei in (a) the 
ground state, (b) the excited 
state. Comparison with 
the diagram above shows 
a much higher excitation 


energy 
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symmetric, but for small nuclei a boxlike potential is an acceptable approxima- 
tion. Because of the spin of the proton and the neutron (compare later Chap. 12) 
and the Pauli principle (compare later Chap. 14), only two protons and two 
neutrons can be put into each level. We start by filling the lowest energy levels 
individually, because the system prefers the state of lowest energy. Here, the 
“last” particle determines the most “visible” properties. If this last state is inside 
a multiplet, then a small excitation energy suffices to lift that particle into 
a higher state of energy; the nucleus is then easily excitable. 

If we consider a nucleus that contains just the number of protons and 
neutrons to fill a shell, then much more energy will be required to excite 
a nucleon into the first excited state. Such nuclei are particularly stable, because 
they can only be destroyed (i.e. strongly excited) if large energy gaps are 
overcome. In such a case we speak of magic nuclei (comparable with filled 
electron shells in the atoms of inert gases) or double magic nuclei.’ 


os 


RS Ey 


6.2 A Particle in a One-Dimensional Finite Potential Well 


Problem. Solve the one-dimensional Schrédinger equation for a finite potential 
well described by the following potential 


—Vo if Ese 
V(x) = ; 
0) | 0 if |x|>a 


Consider bound states (E < 0) only. 


Solution. a) The wave functions for |x| <a and |x| >a. The corresponding 
Schrodinger equation is given by 

2 
where 


"(x) + Vx) (x) = E(x) 
y"=7s (1) 


' For an extensive discussion of the nuclear shell model, see J.M. Eisenberg and 
W. Greiner: Nuclear Theory, Vol. 1, Nuclear Models (Collective and Single Particle 
Phenomena), 3rd ed. (North-Holland, Amsterdam 1987). 
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We define, for the sake of brevity, 


2mE 2m(E + Vo) 
ee pe = 0 2) 


and get: 


ieee ee — = Aly exp(KN) + Byexp(— Kx); (3a) 
2) if -a<x<a: We+k*¥,=0 , 2. = A,cos(kx)+ B,sin(kx) ; (3b) 
3) ifx>aw,—K*W3=0 , ws = Asexp(kx)+ Byexp(— kx) . (3c) 


b) Formulation of boundary conditions. The normalization of the bound 
state requires the vanishing of the solution at infinity. This means that 
B, = A, = 0. Furthermore, y(x) should be continuously differentiable. All par- 
ticular solutions are fitted in such a way that wy as well as its first derivative y’ are 
smooth at that value of x corresponding to the border between the inside and 
outside. The second derivative Ww” contains the jump required by the particular 
box-type potential of this Schrodinger equation. All this together leads to 


Wi(—a=2(-—a) , Y2(a)= w3(a) , 
Wi(—a)=W2(—a) , Wrla)=W3la) . (4) 


c) The eigenvalue equations. From (4) we obtain four linear and homogene- 
ous equations for the coefficients A,; A2; Bz; Bs; namely 


A,exp(— Ka) = A, cos(ka) — Bz sin(ka) , 
_ «A; exp(— Ka) = Apk sin(ka) + B,k cos(ka) , 
B,exp(— Ka) = A, cos(ka) + Bz sin(ka) , 
— xB; exp( — Ka) =— Apksin(ka) + B,kcos(ka) . (5) 


By addition and subtraction of these equations, we get a more lucid form of the 
system of equations, which is easy to solve: 


(A, + B3)exp(— Ka) = 2A, cos(ka) 
K(A, + B3)exp(— ka) = 2A,k sin(ka) 
(A, — B3)exp(— Ka) = — 2B, sin(ka) 


x(A, — B3)exp(— Ka) = 2B,kcos(ka) . (6) 
Assuming that A, + B; #0 and A, # 0, the first two equations yield 

kek tan(ka) (7) 
Inserting this in one of the last two equations gives 

A, =B; ; B,=0 . (3) 


Hence, as a result, we have a symmetric solution with w(x) = W(— x). We then 
speak of positive parity. 
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Exercise 6.2 Almost identical calculations lead for A; — B; #0 and for B, #0 to 
Kk =—kcot(ka) and A;=—B; ; A,=0 . (9) 


The thus-obtained wave function is an antisymmetric one, corresponding to 
negative parity. 

d) Qualitative solution of the eigenvalue problem. The equations connecting 
x and k, which we have already obtained, are conditions for the energy eigen- 
value. Using the short forms 


€¢—ka 4) —*a (10) 
we get from the definition (2) 


2mVoa* sy 
SS] SS SS 


ee dt. ne re 


(11) 


On the other hand, using (7) and (9) we get the equations 
y= ¢tan(¢) , = — Ccot(¢) 


Therefore the desired energy values can be obtained by constructing the inter- 
section of those two curves with the circle defined by (11), within the (¢, 7) plane 
(see next figure). 


The intersections of these 
curves determine the energy 
eigenvalues 


At least one solution exists for arbitrary values of the parameter Vo, 1n the 
case of positive parity, because the tan function intersects the origin. For 
negative parity, the radius of the circle needs to be larger than a minimum value 
so that the two curves can intersect. The potential must have a certain depth in 
connection with a given size a and a given mass m, to permit a solution with 
negative parity. The number of energy levels increases with Vo, a and mass m. 
For the case mVa? — oo, the intersections are found at 


2n—1 
tan(ka) = 00 corresponding to ka = 
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—cot(ka) = © corresponding to ka = nn 


> 


ea ee ee (12) 
or, combined: 
K(2¢) = nn. (13) 
For the energy spectrum this means that 
Z 2 
E, = ae (| =e (14) 


On enlarging the potential well and/or the particle’s mass m, the difference 
between two neighbouring energy eigenvalues will decrease. The lowermost 
state (n = 1) is not located at —Vo, but a little higher. This difference is called 
the zero-point energy. We will come back to it later when discussing the 
harmonic oscillator (see Chap. 7). 

e) The shape of the wave function is shown for the discussed solutions in the 
following two figures. 


RR SSE 


6.3 The Delta Potential 


Suppose we have a potential of the form: 
Vib =— Vox) >; w>O0 ; xeR 


The corresponding wave function w(x) is supposed to be smooth. 


Problem. a) Search for the bound states (E < 0) which are localized at this 


potential. | . 
b) Calculate the scattering of an incoming plane wave at this potential and 


find the coefficient of reflection 


where Wrers Win are the the reflected and incoming waves, respectively. 


Hint: To evaluate the behaviour of (x) at x = 0, integrate the Schrodinger 
equation over the interval (—€, +e) and consider the limit ¢ > 0. 


Solution. a) The Schrédinger equation is given by 


| pie rodte) |W) = E(x) . (1) 


~ 2m dx? 
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Wave functions with posit- 
ive parity; they are symmet- 
ric relative to the origin 


Wave functions with nega- 
tive parity; they are antisym- 
metric relative to the origin 
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Away from the origin we have a differential equation of the form 


2mE 
h2 


SS) = — Wx) @) 
dx 


The wave functions are therefore of the form 
Win) = Ae '*-— Be it x = 0 ore Oe (3) 
with B = ./ — 2mE/h? eR. As |w|* must be integrable, there cannot be an 


exponentially increasing part. Furthermore the wave function should be con- 
tinuous at the origin. Hence, 


Wir) Ae! = ees 0) 


ico eel Ue Ge Ss). (4) 
Integrating the Schrodinger equation from —eéto +6, we get 
h2 par ; 
eo Oe (0) J U(x)dx  2eEW(O) . (5) 
Inserting now result (4) and taking the limit ¢ > 0, we have 
he 
—5—(— BA — BA) — KA = 0 (6) 
m 


or E = — m(V0/2h"). Clearly (though surprisingly) there is only one energy 
eigenvalue. The normalization constant is found to be A = ./mVo/h?. 
b) The wave function of a plane wave is described by 


; 2mE 
ie = ee 72 (7) 


It moves from the left-hand to the right-hand side and is reflected at the 
potential. If B or C is the amplitude of the reflected or transmitted wave, 
respectively, we get 


Wiss He oe em eee), 
NaC) (Cee ee SAD) (8) 


Conditions of continuity and the relation w'(e)— w'(— «) = — fw(0O) with 
f=2mV/h? give 


A+B=C p= --F oa . 
| a ik (9) 
~ f+ Qik 
The desired coefficient of reflection is therefore 
(pe Mla El eee (10) 
IWinl [x=0 [AI? m*Vo + hi*k 


If the potential is extremely strong (Vg > 00)R — 1, ie. the whole wave is 
reflected. 
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The coefficient of transmission is, on the other hand, 
= ale = koe _ ee oe 
i lEieeoemrAl: man V4 hk? cia) 


If the potential is very strong, (Vo > 0)T — 0, ie. the transmitted wave van- 
ishes. 


Obviously, R + T= 1 as is to be expected. 


EXERCSE 


6.4 Distribution Functions in Quantum Statistics 
Let a quantum-mechanical system have the energy eigenvalues ¢; which are 
degenerate g; times and are each occupied with n; particles in such a way that 


n= N(x 17>) (1) 


is the total number of indistinguishable particles, and 


Y nie; = E (2) 
is the total energy (for example, an ideal gas enclosed in a box). 


Problem. a) A state can be occupied by one fermion only, but with an unlimited 
number of bosons. This is the consequence of the Pauli principle. (We will return 
to it later in Chap. 14.) Prove that it is possible to distribute n; particles over 
g; States with 


(I) WFP = @ 
n; 


= Gi 
Te Gp — all 


possibilities in the case of femions; with 


(Fermi-Dirac statistics) 


(11) WwBE = ( ar tile = 1 


possibilities in the case of bosons; and with 


) (Bose-Einstein statistics) 


i 


(Il) WP = g! (Boltzmann statistics) 


possibilities for classical, i.e. distinguishable, particles. 

b) We define {n;} = i enain ces Bec distribution of particles with 
a “weight” W{n,;}, which is simply the number of possibilities of distributing 
exactly n; particles in energy levels ¢;. Naturally the one most likely to be 
realized, i.e. the most probable distribution, is the one with the greatest weight. 
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Derive the variational principle from these remarks and by considering the 
constant number of particles and the constant total energy: 


d{In(W{<n;>}) —aN —BE]=0 . (3) 


Here, the parameters «, f are Lagrange multipliers and { <n,>} is the distribution 
searched for. Prove that in the case of g;>n; >1 the average number of 
occupation, namely <n;>, for the level with index / is given by: 


<n = gilexple, — O/kT Pol | (4) 
with 
+1 for fermions 
6=< —1 for bosons (5) 


0 for classical, distinguishable particles 


Hint: Use Stirling’s formula for calculating n! for large values of n: 


n n 
nt = ./2an™t Vem" wy (*) : (6) 
é 
and then insert x; € IR for n; € N, i.e. change from a discrete n; to a continuous 
variable x;. 

c) Draw a diagram for the Fermi-Dirac distribution <n;>* as a function of 
the energy E at T x 0. How is the parameter p to be interpreted? 


Solution. a) In order to understand the different statistics, we first consider the 
problem of n; indistinguishable balls that are to be distributed into g; boxes. 

(1) Fermi-Dirac statistics. There are n; distinguishable balls to be distributed 
into g; boxes: 


OOOO. @ 
Se Gel U (7) 


9. 92 93 G4 +--+ Gi 


Each box can contain only one ball (the Fermi-Dirac case!). The first one could 
be placed in one of the g; boxes. For the second one there are then g; — 1 
possibilities left, because one box is already occupied by the first ball. For the 
last ball, exactly (g; — n; + 1) possibilities are found to exist if g; > n;. The total 
number of possibilities is given by the product 


(gi — 1)... (gi — 1 ae 
gg) (gq 22) ig Sn ye 
(7, 1;)) 


= 9;!/(g: — ni)! (8) 


So far, we have assumed that the particles are distinguishable; if they are not 
distinguishable, however, there are several identical combinations. For example, 
the combination 
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is identical with 


91 92 93 
| (9) 
91 92 93 


This means that we have overestimated the number of possibilities of distribu- 
tion up to now by the number of permutations among the n; particles. The 
permutations of n; elements raise a factor n;!, so that finally 


ct 
ne = eee — Ji 
nj!(g; — n;)! (* , oo) 


(II) Bose-Einstein statistics. Each box is now able to contain arbitrarily 
many, indistinguishable balls (the Bose-Einstein case!). Here the following 
method will work. There are 


a 
ntl= 
n 


possibilities of placing n Bose particles in two levels. In three levels there are 


CS) C1 (eC) 0G 


possibilities. Generally it holds that 


WBE — Gh se ty ae gitn,— 1 eps i = 3) 
nj = Fee - (Gp); 


We want to reflect on this once more in a different way. For each particle and 
box, a slip of paper is marked with K,,...,K,, for the boxes, and with 
B,, ..., B,, for the balls. Now slip K, is set aside and all other slips (both kinds, 
those labelled K, and those labelled B,) are placed in an urn; there should be 
gi +n; —1 left. Now the slips are taken out again, one by one and in an 
arbitrary sequence, and placed at the right of K,; for example: 


GPR ROR BT RGR Bec, 2 (11) 


This can be interpreted as follows: balls located between two boxes are supposed 
to be in the left one. In our example, it would be ball number 8 that has to be in 
box number 1. whereas balls 3 and | belong to box number 7, no ball to box 3, 
ball 4 into box 2, and so on. 

From part (1) we already know that there are (g; + n; — 1)! arrangements 
possible (arrangements as shown in example (11). On the other hand the 
positions of the balls and their boxes in that row are unimportant. Indeed there 
are no new arrangements if the n,; balls are exchanged among themselves. The 
same is valid if the (g; — 1) K slips are exchanged. Thus there are 


Fe Chama 1) eg il 1) 
ot n(g; — 1)! ee 


different arrangements. 
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Exercise 6.4 


(IIL) Boltzmann statistics: Suppose we have two balls and g; boxes. There 
are exactly g; possibilities of distributing ball 1. If more than one ball is allowed 
in a single box, there are also g; possibilities of distributing ball 2. Altogether 
there are g? arrangements. Analogously, for n; balls there are 


We = git (13) 


arrangements. 

b) As we have indistinguishable particles in both the Fermi-Dirac statistics 
and Bose-Einstein statistics, the number of distribution arrangements for 
n,; particles into energy levels ¢;, if we have the particle distribution {n,, ..., Mm}, 
is given by the product 


Wee nieiaatiyse cain —. [ludaae (14) 
APE Niamey cesta) SRW (15) 


Let us turn now to Boltzmann statistics with distinguishable particles, where 
things are found to be more complicated. We are obliged to set N=) ,n 
particles into the levels in such a way that there are n; particles in each one. The 
number of possible ways of doing this is calculated in the following way. We 
start with N! possibilities, disregarding the group structure containing m groups. 
We then correct this number by factors n;! coming from the arbitrary occupa- 
tion in each group. This yields 


UTE Gree oe and! (16) 
possibilities. Hence, 


N! 


Winans. I | ed 
{n1,N2,N3 Mn} oa aie ak qita , (17) 
which we can also write as 
Wn} = NTT (18) 


f= 


The next step is to calculate the maximum of these various distributions W {n;}, 
in order to find the particular distribution with the greatest weight. The max- 
imum of WF? {n;} agrees with the maximum of In(W*? {n;+), which is easier to 
handle mathematically. Therefore using Stirling’s formula (6), we obtain 


! 
FDS a oe eee 
In Wwe {ni} Zh ni!(gi — nj)! 


7 (Qn)2 go) em 
~~ eee =e = Eepeur 
pen Se (Ga) eee 


~y| n= ulna: ~ nn (a= m)la(ae~m) . (19) 
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In the last two steps we have made use of g; > 1, n; > 1. To find the maxima of 
a distribution we admit continuous values for n;, so that n; > x; € JR, and 
introduce two Lagrange multipliers? B and « to incorporate the conditions 
E=Y¥,ne; and N = ¥,n;, respectively. The variational principle yields 


0 cE WSS aes BYE; ss 5 | 
j J 
=Vs 6 PID foe 
=) Xj ax WV io) er 0 


= 5th (ma ie *) (20) 


Xj 


The last step is found as a result of (19), thus: 
e 
Mi 


= 0;;[— Inx; — 1 + In(g; — x;) + 1] 


9: — X; 


Xi 


= In(g; — x;) — Inx; = In (21) 


A necessary condition for the existence of the maximum is that the terms in 
parentheses in (20) vanish; therefore we deduce that 


BAA cht * co xe = (gi — xehrsy 2) 
and then 
Cone = pene (23) 


Finally, it is convenient to rewrite the parameters B and « in terms of the 
customary (and more physical) quantities T and p, so that 


(xJep = (trp = 9i{exP Le: — wY/kKT I + ai 
with 


i Lt 
= — ee x 24 


Coming back to the case of Bose-Einstein statistics, we proceed analogously. 


2 See H. Goldstein: Classical Mechanics, 2nd ed. (Addison-Wesley, Reading Ma. 1980) or 
W. Greiner: Theoretische Physik, Vol. 2, Mechanik, 4th addition (Verlag Harri Deutsch, 
Thun and Frankfurt a.M. 1981). 
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From 


(Char Ge 1) 
In WE fn) = Vn 
NS sa an 


my ain) 2G, le oe 
x [amt ema g, — 1) V2 ema tiq—1) 
~ Y{In(2a) "7 + (g; + nin (g; + 1) 


—nlnn; — g;|ng;} : (25) 
it follows that 


: aa (26) 


— In WF {x,} = In(g; + x) + 1 — Inx;— 1 = In 
OX; X; 


and therefore, : 
in WE {x} — Sexy — aS xy |= Soni(n# 2% — pe, — 2) 5 (27) 
j ji Fi ms 
Thus we deduce that 


(Xi)pe = ("ibe = gitexpLé = 1/kT |= ee : (28) 


In the case of Boltzmann statistics, we again proceed analogously and find 


gi 
In W® {n;} = In(N!) + Zale a 


= InN! + y(n Ing; — Inn;!) 


=InN!+ Y (nln g; = Inf Dron come) 


= InN! + )(njlng; — In\/22 — njlnn; + 1;) 


=InN! +) (n; —In\/2x 4+ njlIng;—njlnn,) , (29) 
so that 
a Wx) =a lng: = Ins etn (30) 
OX; Nee : aa %; : 


Here, the variational principle yields 


of W®ix;} — BY ex; — a Ex)| =o, c (2) — pe, — “| ' (31) 
j j A 
and thus, finally, 


(xp = (nip = giexpl— (e — O/kT] (32) 


The Lagrange multipliers, which were introduced in (24), are fixed by the 
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conditions 

=) Bog gina I =) Be (33) 
Therefore 

eee) eee en v(t) 


are functions of 4 and 7, respectively; the chemical potential « = p(N, T) 1s 
a function of the particle number N and of the temperature 7. The interpretation 
of the parameter T is suggested by comparison of e.g. the first equation in (33) 
in the case of Bose-Einstein statistics with Planck’s radiation law [see Exam- 
ple, 2.2, Eq. (13)]. Hence, it is plausible that 7 will also be the temperature in the 
case of Fermi-Dirac statistics and Boltzmann statistics. For a stationary num- 
ber of particles and fixed temperature 7, is fixed, too. The quantity 
K In(W{<n;>}) = S is called the entropy of the system, which coincides with the 
concept of entropy in thermodynamics. The entropy S gives information about 
the system’s disorder. The bigger S, the “more wildly” the particles are spread 
over the levels. From (23, 28) and (32) we finally get the form for the distribution 
functions corresponding to the various statistics. In compact form, <n;> can be 
given as a function of temperature as 


Cp Gepil; wiki lo, (34) 
+1 in the case of fermions 
using 6 =< —1 in the case of bosons 
0 in the case of classical particles 
c) In this special case we have from (23) 
<ng>F? = {exp[(E — w/kT] + 1)" (35) 


and therefore 


Oo ems yi 
hin 2 =a 2 = OU Sy) 
T>0 

I OB 


(solid line in the figure below). 
For T>0, corresponding to an energy kT, the Fermi level is “smeared” over 
the region AE ~ kT (see figure). 


CBS Sooo SSS 


6.5 The Fermi Gas 


From the foregoing we know that the allowed energy levels of noninteracting 
particles of mass m in a three-dimensional, infinitely deep potential well with 


Exercise 6.4 


The Fermi-Dirac distribu- 
tion as a function of the 
energy E 
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6. The Schrodinger Equation 


edges of length / are given by (see Exercise 6.1) 


nh? 


~ In? 


(n2+np+n2) 3 ny nyn,EN . (1) 


For electrons in a metal or for gas molecules in a container, the vaue of / 
is supposed to be so large that the energy spectrum may be regarded as con- 
tinuous. 


Problem. a) Show that the number of states 4N of energies in the interval 
between FE and E + AE is given by 


Ni (ry VE2AE (2) 


h 


b) Calculate the Fermi energy & = » (T =0) for an ideal Fermi gas of 
N particles. Make use of the fact that each energy level is twice degenerate 
because of the spin degree of freedom. 


Solution. a) Instead of counting the states (n,, n,, n,) of equal energy (which 
would not be easy), we use the quasi-continuous approximation and determine 
the number of points inside a spherical shell of radius 


=) Ny Ng Ee (3) 


and thickness An in the first octant in (n,, n,, n,)-space, since ny, ny, n, € IN. This 
is simply the “volume” 


AN = § (4nn?)An = Sn? An . (4) 


According to the assumption n? = E(2ml?/x*h7), we have 


a ee - 

An = gg = V2" ptr gg and (6) 
dE 2h 

AN = [(2m)*?V/(4nh®)] E'?-AE = CE"? AE, 

C= (2mpPPvi(4ah) , V=P . (7) 


b) The energy of the discrete case is given by 
Ey = > (ni) rei . (8) 


i 


If the energies are continuous, this expression changes into 


Er= s Se Ak = : g(E\) f(E) EAE; 
izo 4& i=0 
— > | gE) (Bede (9) 


6. The Schrédinger Equation 


Here, g(E) is the number of states in the energy interval AE; f(E) describes the 
fraction of occupied states. In fact, we are able to interpret the energy interval 
(E, E + AE) in the continuous case (for sufficiently small AE) as one level with 
degeneracy g;-, = g(E)AE = sAN (s = 1 for bosons, s = 2 for fermions of spin 
4). The expression f(E) = <ng>/g(E)AE is called the distribution function and 
g(E) the density of states. 

Because of spin degeneracy (s = 2), we have from (7) for fermions: 


g(E) = 2CE'? 
fPP(E) = [exp(E— w/kT +1] . (10) 


For T=0, fre o(E) = O(E — &) [see Exercise 6.4; 0(E — é) is the Heaviside 
step function] and the number of particles N is assumed to be constant, Le. 


N= ¥ ny > f fp2o(E)g(E) AE 
0 


i=0 


ioe) ee 
= | O(2— 4) 2CE'? dk = 2C | EN* dE — : Cae 
0 0 


3N PAH h2 N Dis 
a eee = (3 Pye fp ; 
sate (G2) ue (7) ce 


Application: an electron gas in a metal can be considered a Fermi gas. The mean 
energy at temperature T = 0 is then 


Se bE eae 3 

E= soe eee == 8p (12) 
['E dE 5 

From this quantity we can calculate the pressure p of the electron gas (zero-point 

pressure). It is possible to define the pressure as the work dA that has to be done 

to reduce the volume by dV:dA = p|dV|. This work is equivalent to the increase 


of contained energy, which is NE. Then, without changing the total number of 
electrons during compression, we simply obtain from (12) and (ive 


dE 


no es | | Aeneas : 13 
p N\iy f (13) 


Taking silver as an example, we get a zero-point pressure of approximately 
2 x 10° times the atmospheric pressure. This enormous pressure just means that 
the compression of the electron gas requires a large amount of work. The result 
is a typical quantum-mechanical effect: if all electrons were in the lowest state, 
then only the much smaller amount of work involved in raising the zero-level 
energy of each electron would be required, approaching zero with increasing 
volume. The Pauli principle causes occupation of higher states, the mutual 
distance of which increases with a decrease in the volume as a consequence 
of the uncertainty principle. Therefore the Pauli principle is essential to the 
explanation of the limited compressibility of solids. 
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The schematic two-centred 
potential 
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Ns RRR ae 


6.6 An Ideal Classical Gas 


Problem. Show that for the ideal classical gas 
E=43NkT 


is valid, where E is the total energy and N the total number of particles. What is 
the interpretation of the temperature 7 in this case? 


Hint: Use f@ x'~1e~* dx = (0) together with (t + 1) =tF@ and F(3) = Jn. 


Solution. For the ideal Boltzmann gas we have [see Exercise 6.4, Eq. (34) ]: 
(eye and Gib y =e (1) 
Then 


BCP, | f (Eg(b Bde — cc j F3!2 e-E/AT ap 
0 
= Cell (kT) 5/2 { 3/2 em dx 
0 


= 1 (5/2)C(kT)5? e#/*? and (2) 


ee { f° (B)g(E)dE = CetkT (KT? J xh /2 e *dx 
0 ce) 


SWC) ce. (3) 
It follows from (2) and (3) that 
[Ee TP ue, IE ee SIO (4) 


The mean energy of a particle is E/N =3kT. This result elucidates the 
meaning of the parameter 7, at least in the case of the classical ideal gas: T is 
directly proportional to the energy per particle. 


EXERC|SE 


6.7 A Particle in a Two-Centred Potential 


We consider the following simple model of a diatomic (one-dimensional) mole- 
cule, consisting of two potential wells lying side by side (see figure); here, the 
potential is given by (a < 1). 


0) |x| > (J+ a)/2 
os 1) for (!—a)/2 < |x| < (i+ a)/2 
Vo |x| <(l— a)/2 


6. The Schrodinger Equation 


Particles of mass m described by the Schrodinger equation move in this poten- 
tial. We wish to find the eigenvalues (0 < E < Vo) of the Hamiltonian operator 
and the corresponding wave functions. We shall then compare this result with 
the result for two atoms at large distance (I > a) and discuss why molecular 
binding is established. 

We write down the Schrédinger equation, 


A 


2m 


EW (x) = | rae 2 ro | (x) 


for the regions H, III, IV. 


—4(1 4+ a)—$(1- a) 0 $(1-a) $(1+ a) a 
» 
Il, IV: SEE ys aes : 
2m 


Ill: (-5+ Ha ) vs =Ew(x) . (1) 


In regions I and V, the wave function has to vanish, y= Wy — 0. Vhese 
equations give 


De 9 ea a 

Il) yw = A’sinkx + A” coskx —1(i+a)<x< —t(l-@ , 

Wy Be’ ees —lil—a)<x< +3(l-a@) , 

IV) W =D'sinkx + D’coskx +4(l-a)<x< +3(l+a) , 

0 a oy (2) 
with 

ne a ee me — E) 6) 


The wave function has to be continuous; in particular, we must have 
w( — 4(1 — a)) = Wu ( — 3(1 — a)), etc. Furthermore, the derivative has to be 
continuous for |x| = 4 (J — a). Therefore we have to “join” the different solutions 
in an appropriate way, which will only be possible for certain energies 
E(~ k, B). The boundary condition at the points x = +43(/ + a) immediately 
implies 
I) w(x) =Asink[x+3+a)] , 
IV) W(X) = Dsink[x-—4(l-a)] . (4) 
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The different regions of the 
potential 
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|?) i ee 
Exercise 0.7 Now we still have four boundary conditions: 

Be~8/2-9) 4 Ceth2U—9) — Aginka , 

Bhewt2" Dea Re" a9 Ak cos kane 

Ret hi2 Co) oo Ce 820s) = =) sin ka 

Bpe #2 = Che 2205? Di costa 


or 


2(B + C) cosh ( —a)=(A—D)sinka , 
2(B — C) Bcosh§ (I — a) = (A + D)k cos ka , 
—2(B — C)sinh £ (1 ~ a) =(A + D) sin ka : 


—2(B + ©) B sinh (I a) = (A — D)kcos ka : (5) 


These are four (partially decoupled) equations for the variables B + C, B — C, 
A+ D and A — D that are not allowed to vanish all together. Now 3 B(/ — a) is 
always + 0; therefore we can divide by the hyperbolic functions, 


sin ka k cos ka 


B+C=———" _(4- D) = -—~=*_(4-D) , 
2 cosh = (I — a) 2p sinh = (! — a) 
em ey ee iy (6) 
2 sinh (I — a) 2B cosh; (I — a) 


— 
Ay 
oT 
1 i ink 
| ctanka = —Feoth 5 (I~ a) and B+¢=—3E _4-)|} (7) 


2 cosh : (1 — a) 


A, 


6. The Schrédinger Equation 


Or 


1 1 ~ si 
Dee en (end B= Cc Se y= 
k Rome iB 

2 sinh = (! — a) 


Ag 


(3) 


The conditions A, to Ag are either true or false. Therefore we abbreviate the 
above statements using logic symbols: 
(Any AGA (A, v Ag )i= true 


> 


v means “or” and A means “and”. 
Expanding gives 


(A, A A3) v (Ay A Ag) V (Ap A A3) V (Ad A Ag) = tre. 


First of all, we show that the conditions (A, A A3) and (A; A Aq) cannot be 
true: 


(A, A A3)=truesA=B=C=D=0 . (9) 
Since all the variables are not allowed to vanish, we have 


(A, A A3) = false 
(45 A Az) = tue = tanh (0 a) =coth 5 (1a) : (10) 


The last equation is true only if 6 = 0; this case has also been excluded above. 
Hence, (4) A Az) = false, too. Our logical equation is therefore reduced to 
(A, A Ag) V (Aq A Az) = true 


Inserting the definitions gives the following system of equations: 


rob — Bie ©, = 0 5 


1 1 By 
—tank,a= ——-tanh—(l— a) , or 
Mae oe! 
—sink,a 
aS 


2 sin (1 = a) 


Ao + Dy =B,—C, =0 9 


1 j B 
—tank,a= ——coth—(l- a) , , (11) 
ee B, 2 
Ay = koa As 
\ Deas — a} 


& 
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The two middle equations yield the eigenvalues E” and E$), respectively, as the 
solution of a complicated equation 


{kj = (meh) = 5) B; = [2m(Vo = Se Viiales ; (12) 
Now we drop the upper index, which numbers the different solutions. The wave 
functions in the regions II, III, 1V are consequently 

I) Wy =A, sinky[x+43(1+a)] , 

Wo = Az sink,[x+$3(l+a)] , 

Il) w, = 2B, sinh p,x , 

Wo = 2B, cosh Box > 

(IV) #, =A, sink; [x—4(!+a)] , 

i — A, sin k, [x —3(/+a)] . (13) 
Strictly speaking, we should now normalize the wave functions; this, however, 
we will not do. Since they belong to different eigenvalues, they are certainly 
orthogonal. The normalization determines the A; and therefore also the B; up to 
a phase. 

We note that w, and wy. are also eigenfunctions of the parity operator: 
W, possesses negative parity, % positive parity. 


In the case of large distances (! > 00), i.e. two distinct atoms, both equations 
determining the energy become identical, 


k p18 
pe alle or tan = /2m =— =a : (14) 


This equation is not analytically solvable, either. [If we analyse the extreme 
case Vo > 00, we get two atoms which are separated by a very high potential 


wall and are therefore also independent. Here we have tan(a/h)./2mE = 0 (see 
Exercise 6.1). ] 

Now we want to derive an approximate relation for the energy differences 
E, — E= 4A, and E, — E = 4). E 1s the energy for two completely separated 
atoms (I > 00, Vo < «); E,(i = 1, 2) is the energy for two atoms at a finite but 
large distance |. We assume fi;(/ — a) > 1, 1.e. we examine the lower energy states. 
To abbreviate, we define b = 3 (! — a); therefore Bb > 1. k and f are assumed to 
have the same meaning as above (! > oo). 

With the aid of Taylor’s formula, it follows that 


6. The Schrodinger Equation 


if we drop expressions containing 
ApaAje “°° “and @ =e (?° 
With this we get, using tanka = —k/f, 


A; 1 A; 
tank,a = k ka — )=tank SS 
ank,;a tan ( a+ os) an Cotes a2 - ase 


ae 
tank 1 + tan? ka) ka — 
an ka + (1 + tan a) ka 


= r 4 
ee as (16) 
PEE 2) 
b = tanh | Bb ee ee 
fae — ain eo 5%, B) 
1 Ay 
b 
ac 2(Vo — E) 
sinh Bb — cosh fb 
cosh fb 
=1—2e 7" , (17) 


Couey 7) coth Bb = 1+ Je 2hb 


= tanh Bb — 


= tanh fb 


=e" Gan gp t= 1 


Consequently 


1 1 A; k  kaVo 
<tanka=-(1—-—£ || -—>+ ay) 
eee e(1 all B 2E(Y%— E) 


il aVo 
== oP ati = | ake 
2b EV 5) 


1 
B 
1 V(1+aB8)—£ 
B 


A, 
1 TAN, ae 


A < 
ih bos sae tags |e Jee) 
By 2( 


B Ay 2) 
— i) 2 2p _ Ay 
~~B7 Bo -2B(%— E) 
1 2 opp 42 12 
= OO PO ars aBV> —E) (18) 
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| -+ co: two separated po- 
tential wells with energy 
levels E™, E®,...;1 large, 
but finite: the formerly de- 
generate energy levels split 
up; one level is lowered, 
the other one is elevated in 
energy 


Hence, the eigenvalue equation for E, reads 


a! Vo ON SE ee 2 oe (19) 
B  -2BE(Vo — E) BB ANG = 18) 
with the solution 
A, = Eres (20) 
Vo(1 + af) 
Correspondingly, 4, = — 4;. The energy E, exceeds the energy of a single atom; 


the energy E, is, however, smaller. Hence, we get the following picture: 


Ee) 

Bee oe S 3 

Sie ES ) 

2 

E@--- 7-777 oe 
oy ee roa 

BRDeS Fa poms ona ey 

BM) 


If we bring together two potential wells (atoms) with an electron in the state 
of lowest energy, we will decrease the energy, i.e. we get a bound state for the 
molecule. 

Our model is of course, somewhat unrealistic. For one thing, we should 
question how well atoms can be described by one-dimensional rectangular 
potential wells. Furthermore, we have to take into account that the two atomic 
nuclei are repulsive, so that binding is hampered. The two atoms will adjust 
themselves to a distance at which the force of attraction caused by the change of 
energy of the electron ts as large as the repulsion of the two nuclei. In this state, 
the total energy of the system (2 atoms + le) 1s minimal. 


6.1 The Conservation of Particle Number 
in Quantum Mechanics 


In electrodynamics, the well-known continuity equation is valid: 


006 
Ot 


+divj,=0 , (6.14) 
where Q, is the charge density and j,, the current density. This equation is the 
law of conservation for the electric charge: if the charge density in a volume 
element changes, then a current flows through the surface of the volume element 
(Gauss’s law). 


6.1 The Conservation of Particle Number in Quantum Mechanics 


Now we try to find a similar relation for the number of particles in a region. 
Instead of the charge density, we consider the probability density w = w*y. If we 
demand that no particles be created or annihilated, we also have a continuity 
equation: 


ow 0 
ar = , (6.15) 


Our aim is to deduce the particle current density j. To this end, we begin with the 
time-dependent Schrédinger equation 


op 1 - 
ee ee 
ot th , Ce 
The complex-conjugated equation is 
ow* ips 
—=-—H*y* . A7 
ot ih y on) 


Multiplying the first equation by y* from the left and the second by yw and 
adding both, we get 


Swhts (WtHy — YH) =0 . (6.18) 


If we assume that the potential is independent of velocity and real, we are able 
to insert H = p?/2m + V(r) and obtain 


Swwte  VV2y* — weVp=0 . (6.19) 
From the second expression in brackets we can extract a nabla or del operator: 
WV? We — WV? = WV2W* + VEVY* — VPVy* — PRVe 


= Viv — VY) 
With this we have 


6 * ih. * fk = 
or (Wy) + ae div(wWVw WVW=0 . (6.20) 


This equation is of the form of the desired continuity equation, if we define the 
particle current density in the following way: 


j= wy - vw) (621) 


Application of Gauss’s law 


{ (divj) dV = $j *ndF (6.22) 
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leads to the integrated equation: 
Liupaieiaalypvile 0 (6.23) 


The particle flux through the surface of a region is equivalent to the variation of 
the particle density inside the region. 
We had required the time-independent normalization of the wave function 


fuewav=i , 


i.e. that the particle current through an infinitely distant surface vanish. Hence, 
by inspection of (6.23), only those states can be normalized to 1 whose current 
flux through an infinitely distant surface vanishes. If we want to calculate the 
mass current density or the electric current from the particle current density, we 
have to multiply the continuity equation by-the mass (or charge), since the mass 
(or charge) density is given by 


One TU en — 01) ae (6.24) 


Therefore a law of conservation exists for the mass and charge of a system, too. 
As an example for the calculation of a particle current, we take a plane wave 
Ww = Aexp(ik-x). From (6.21) we get 
: hk 
j= A—e= wry —wo. (6.25) 
m m 


The close relation between particle current j and velocity v is immediately 
evident. Of course, the current through an arbitrary distant surface is not zero in 
this case, so the functions are not normalizable in this way; the plane waves have 
to be normalized to 6-functions according to Chap. 5. 


6.2 Stationary States 


As we recall, in the case of a not explicitly time-dependent H, we were able to 
separate the variables x and t of the time-dependent Schrédinger equation, 


wae “ 
ih oe Wilco ete (6.26) 
With w(x, 2) = W(x) f,(t) we got two differential equations, 
0 n 


In = Ent)» Hal) = Ena) (6.27) 


From the first equation we obtained the time factor f,(t) = exp[ —i(E,/h)r], 
which was normalized in such a way that |/f,|? = 1. Equation (6.27) is the 


6.3 Properties of Stationary States 


stationary Schrodinger equation. With E, = hw,, we have for the eigenfunctions 
of H: 


VOCE) awe ee (6.28) 
The general solution ¥(x,1) of the time-dependent Schrédinger equation is 


a superposition of all w,,(x, 0): 


F(x, t) = )) C,(0) a(x, 2) 


=F C,()vAx) with C,(t)=C,(0)e7t (6.29) 


The coefficients C,, are determined by the integral 
CO ee Oj Ode (6.30) 


To prove this, let us first to observe (6.29) at time t = 0: 


F(x, 0) =) Cr(O)Wrlx) (6.31) 


Since the wave functions W,(x) are orthonormal, i.e. (WnlWn> = Omn, WE Can 
multiply both sides by w*(x) and integrate. This gives us 


f ¥ (x, OWA(x) dx = Y C,(0) J Wal) wF (x) dx = Y, CuO) Sam = Cm(9) 


This is precisely result (6.30); in particular, we refer to the analogy of the 
expansion (6.29) for the decomposition of an arbitrary vector A in terms of an 
orthonormalized basis e;: 


A=) ae; , 
i 


where the components (coefficients of expansion) 
Qi — Ae; 


are scalar products of the vector A and the basis vectors e;. According to this, we 
can view (6.29) as a decomposition of a state ¥ (x, ¢) in terms of the basis w,,(x). 
The coefficients of expansion C,(t) are consequently the components of the state 
W(x, t) in terms of the basis vectors W,(x). 


6.3 Properties of Stationary States 


It holds that W*(x, t) Wa(x, ) = Wr (x)W,(x), since the time factor is normalized. 
Therefore, the probability density is constant in time for stationary states: 


Wot) = Wee (6.32) 


135 


136 


6. The Schrédinger Equation 
The current j(x.f) 1s given by (621) 
ih 
In) = > Wal, VW (x,t) -— Wr OVO] - (6.33) 


Since the nabla operator does not affect the time factor, we also have 
JulX, 0) = Jnl), (6.34) 


ie. the current of stationary states is constant in time as well. Now we can 
expand w,,(x, f) in terms of eigenfunctions of any operator A: 


W(x, t) = a) Wal) 


For stationary states, the probabilities alr for finding the value A of the 
observable described by the operator A, are time independent if A is not 
explicitly time dependent, since 


Cult) = [UROL 1) dx = e™" [ WHOOUACO dx, 
where for t = 0 
C4(0) = J VA) Wax) dx 
From these two equations the statement 
IC4(Ol? = |C4(0)|? (6.35) 


follows. 


RRS ——————E———————— See 
6.8 Current Density of a Spherical Wave 


The spherical wave 
+iker 
y= 5 kak- 


r r 


is given. 

Problem. a) Calculate the probability current density j for the wave function. 
b) With k = k(r/r) calculate the number of particles that flow per s through 

a sphere of radius r. What physical processes are described here by wy? 


Solution. i. The probability current density of a wave function is defined as 


(Y*Vy — wv*) Hm WwrVvy} 


= 


6.3 Properties of Stationary States 


For the wave function 
= exp(tiken/r , 
the gradient is 
Vy = Vexp( + ik«r)/r 


1 
= ~ Vexp( + iker)+exp(4ik-r)V(1/r) , 
= (+ ik — r/r?)exp( + ik-r)/r 


From this we have for the current density, with the help of p = hk: 


b) The number of particles per s streaming through a unit surface is given by 


N= jinx ie x i ae 
r r 


Thus 
Nx => SP A4nv 


expresses the number of particles passing through the surface of the whole 
sphere. 

Depending on the sign, the wave function describes either an emission or 
absorption process. 


PRES —_»__»_ SSS 


6.9 A Particle in a Periodic Potential 
Let V(x) be a periodic potential with V(x + a) = V(x). 
Problem. a) Show that the Hamiltonian 
Be ee ME) aD) 


commutes with the translation operator 7(a), which has the property 
Ta Wx) = W(x +a). 2) 


b) Deduce from the periodicity of the potential that the wave function has 
the form 


W(x) = e!** b, (x) (Bloch functions), (3) 


where ke R and ¢,(x + a) = oy(x). Suggestion: For two linear independent 
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F) 
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Emission of a_ spherical 
wave. The vectors illustrate 
the radially directed phase 
velocity of the wave 
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6. The Schrédinger Equation 


solutions W(x), W2(x) of the energy eigenvalue E, which are simultaneously 
eigenfunctions of T(a), we have 


dl 
We) =| Yala) pals) vated Fats) |= const G 


c) Discuss the eigenvalue condition and determine the allowed range of 
energy in the potential 


+00 
Voxj—=—Vo Y 6+ nay) Vo = 0 ere Z (5) 


= eg) 


[Kronig-Penney model of the energy levels (band structure) in solids. ] 


Solution. a) The translation of a wave function (x) by a is given by 


woy=EA(ZY pros) = Tarde - 6 


1 
V2) ie 2 ay 


Here Taylor’s formula has been used to expand W(x + a). Obviously, the 

translation operator is 7(a)=exp(ipa/h) with the momentum operator 

p = —1h(d/dx) and 
ee nee 

2m dx 2m 


From this we get directly 


| 7 oP |= ; (8) 
Besides, 
[T(a), Vix) = T@VA)W(x) — V(X) Tax) 
=V(ixtaw(xta—VxWxta=0 , (9) 


because V(x + a) = V(x), since periodicity is assumed. 

b) The one-dimensional Schrédinger equation is an ordinary differential 
equation of second order. Therefore it possesses two linearly independent 
solutions for each eigenvalue E: we and we Since (Ta), H] = 0, we can 
choose yw, wy to be simultaneous eigenfunctions of T(a). So we have 

TOV C= We cot a) — 0 hele ae (10) 


The 4 are constant numbers that depend on the energy. Since Hy®O= Ey®, 
we get 


LHP COW? 0) — VP Co] =< wey) 


(1)’ a (1)’ (1) ae (2) 7 (1)” 
E Us des —We We 

ny ae 
E h2 


ee 


=We ax (VO) — E)We — We aa (V— Ely =0 (11) 


6.3 Properties of Stationary States 


Hence W(x) = const. Since W(x) = W(x + a) = 4,4, W(x) is also valid, it fol- 
lows that 4,4, = 1. Besides, 


We (x +a) = Tawe*(o) = AB WE) (12) 


ie. A* are also eigenvalues of T(a). But they cannot differ from A“, because y? 
and wy? are linearly independent; i.e. either 


(1) 1) 2 2 ai 
LS ea 7 eR (13) 
or 
(Ce C2?) (2)* _ 40) i 
Oe) ee Ae = le (14) 


If, in the first case, we assume without restriction of generality that Hi = iethen 
1) cannot be square integrable, because w(} P(x + na) = Cay wD (x) due to 
(10) and W(x) (x) will increase ad infinitum for x > oo. Therefore the second case 
must be true. 
ee — ete ; PO es e7 ite wane R (15) 


(%_ = 0 includes the case 4, = 1). If we define kp = ag/a, we get 


w(x t+ a)=eleeyO(x) and we (x +a) =e We Oa) (16) 
In the decomposition 

Ue (x) = el he'(x) (17) 
we must have 

ge xta=de%) , (18) 
since 


efter gitar L(x) = elt E(x) = W(x + a) = eet ell E(x + a) 


(for i = 2, the proof is analogous). In general, we suppress the index E of the 
wave function and write 


Vie Sel CHES) (19) 


where w, and y_, are linearly independent and ¢, iS periodic. 
c) In the range 0 < x <a, it holds that V(x) = 0, and therefore 


W(x) = Ae += Be ik* 


- with 
K2 = 2mE/h? , xe(0,a) . (20) 
But now we have, because of (16), 
W(x) = e™* W(x — a) 5 (21) 
therefore with (20) we must have in the interval x € (a, 2a) 
Core Ae 2 2: Ben asi (22) 


The wave function W, but not the derivative of w, is continuous at a, as can be 
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eae rr 


Exercise 6.9 seen by integrating the Schrodinger equation from a — é toa + é: 
ate h2 
(Saale: E W(x) + 5 Ww") — resus) | 
ate h? 
= f dx | Ew W(x) + ae w(x) + Vod(x — ans) | 


From this follows 


ate 


+ VoW(a)=0 or 


a-e& 


hoe 
aA w'(x) 
F-[Wila+)—W(a— 9] + Vola) =0 
m 
which is written in the limit ¢ > 0 as 
5 [W'(a+0)— Wla—O)] + Yopla)=0 (23) 


The continuity of the wave function at x = a yields 

wa-—O0)=Wat+9) . (24) 
With (22), (23) and (24) become 

Ae? | Be ** = 6'4(4 + B) 


2m 


el4(in A — ik B) — (ix Aei** — ixnBe™'**) + 52 


Vo(Ac'" 4: Bes 2) am 
(25) 


or 


eika eika eixa eika 
ix(e* — e'"*) + (2m/h?)Voe'**  — ix(e** — e'**) + (2m Vo/h*) e184 


()-0 , 
x pea : (26) 


The vanishing of the determinant leads to the eigenvalue equation 


Vo sin k 
cos ka = cos ka — ee =f(ka) . (27) 
h Ka 


This equation relates k of (16) and E. Instead of choosing E and calculating k, we 


can also choose k and calculate E graphically. Since |coska| < 1, we have no 
solution of the eigenvalue equation for | f(xa)| > 1. 


Ist Case. E < 0 (bound states), 


c= 1b oR eb) \enre oe (28) 


6.3 Properties of Stationary States 


Now we have sin if = i sinh f, cos if = cosh f and 


am Vo sinh Ba 


f(i Ba) = cosh Ba — i a 


is a steeply monotonous increasing, even function that exceeds 1 at Boa (see next 
figure). Therefore 


2m 2n 


|E| (30) 


or, because E is assumed to be negative, E > — Eg. This is illustrated in the last 
figure below. 


2nd case. E>0, xrelR, 
According to (27) f(xa) is even and equal to 1 at xa = x with 


am Vo sin x : 
cos xX — ee ee: (31) 
amV, eS 5 dene 
— Poe 2 sin 5 008 = = 2sin? = (32) 


x solution of (33) 
e zero of sina /2 


This is fulfilled for 


- ob BX: amVo 1 
at or ME aR x/2 (33) 


i.e. for (x;), = 2nn and (x2), = 2nx — A(nz), where ne IN and lim, _,,, 4(nz) = 0. 
Analogously we find the points at which f(a) is equal to —1 at 
(x1), = (2n — 1)x and (x3), = (2n — 1) x — A[(2n — 1)z]. Between (x), and 
(x1),, Or (x4), and (x;),, there are no allowed energy eigenvalues, as is obvious 
from the above figure. 

The graphic representation of the energy dependence on the wave number 
k is characterized by “forbidden regions” that shrink for increasing k. If 
f (nn) = (—1)", then we obviously have ka = nz for cos (ka) = ( — 1)", i.e. we have 
“energy gaps” at those places (see figure above). Therefore the spectrum falls 
apart into “allowed” energy regions (named energy bands) and “forbidden” 
energy regions (gaps) (see besides figure). Energy bands play an important role in 
the motion of electrons in periodic structures in solid state physics (conduction 
bands, valence bands). 
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As soon as the function 
f(iPa)>1, then (27) will 
have no solutions 


Graphical display of the 
solutions of (33): they are in- 
dicated by x, and are shifted 
by —A(nz) from the zeros of 
the function sin(4x), which 
are located at x/2=nn, 
=O 2 oo 


forbidden 


Allowed and forbidden re- 
gions 


Energy bands with interven- 
ing energy gaps 
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6.4 Biographical Notes 


SCHRODINGER, Erwin, Austrian physicist, *Vienna 12.8.1887, tAlpbach (Tirol) 
4.1.1961, was a student of F. Hasenohrl. As a professor in Ziirich, S. worked on statistical 
thermodynamics, the theory of general relativity and the theory of colour vision. Excited 
by L. De Broglie’s Ph.D thesis and A. Einstein’s publications concerning Bose statistics, 
S. created wave mechanics. In December, 1925 he defined the Klein-Gordon equation and 
later, in January 1926, he invented the Schrédinger equation, which describes, in non- 
relativistic approximation, the atomic eigenvalues. In March 1926, S. proved the math- 
ematical equivalence of his theory with matrix mechanics (M. Born, W. Heisenberg and 
P. Jordan). S. always attacked the statistical interpretation of quantum theory (as did 
Einstein, von Laue and de Broglie), especially the “Copenhagen interpretation”. In 1927 
S. went to Berlin as Planck’s successor and emigrated in 1933, as a convinced liberal, to 
Oxford. In the same year he was awarded, together with P.A.M. Dirac, the Nobel Prize in 
Physics. In 1936 S. went to the University of Graz, Austria, emigrating a second time 
when Austria was annexed. The Institute for Advanced Studies was founded in Dublin 
for him and others. In 1956, S. returned to Austria. 


BOSE, Satyendra Nath, Indian physicist, *1.1.1894, 4.2.1974 Calcutta. Together with 
Einstein, he set up a theory of quantum statistics (Bose-Einstein statistics) that differs 
from the classical Boltzmann statistics and from Fermi statistics, too. B. invented this 
statistics for photons; Einstein extended it to massive particles. B. was a professor in 
Dacca and Calcutta from 1926 to 1956. 


BOLTZMANN, Ludwig, Austrian physicist, * Vienna 1844, +Duino near Trieste 1906. 
He studied physics at the university of Vienna where he was an assistant of Josef Stefan. 
B. became professor of mathematical physics at the university of Graz in 1869. He also 
taught at Vienna, Munich and Leipzig. Among his students were S. Arrhenius, W. Nernst, 
F. Hasenohrl and L. Meitner. The young B. worked successfully on experimental physics 
(he proved the relationship between the refractive index and the dielectric constant for 
sulphur, which was required by Maxwell). Near the end of his life he occupied his mind 
with philosophical issues, but his main interest was always theoretical physics. 

The central problem of his life’s theoretical work was the reduction of thermo- 
dynamics to mechanics, requiring the solution of the contradiction between the revers- 
ibility of mechanical processes and the irreversibility of thermodynamical processes. He 
showed the relationship between the entropy S and the probability of a state W with the 
formula S = k In W (k: Boltzmann’s constant). This was the starting point of quantum 
theory both in the formulation of Max Planck in 1900 and in the expanded version of 
Albert Einstein (1905). Other important achievements of B. are the formulas for the 
energy distribution of atoms moving freely or in force fields (Maxwell-Boltzmann 
distribution) and the theoretical explanation of the law of the radiation power of a black 
body (Stefan-Boltzmann law, 1884). 

B. was an exponent of the atom theory. The small response and even rejection that he 
received for it from many contemporary physicists disappointed him throughout his 
lifetime. He did not live to see the final victory of the atom theory introduced in 1905 by 
Einstein’s theory of Brownian motion. 

B. committed suicide at the age of 62. 


FERMI, Enrico, Italian physicist, *29.9.1901, Rome, +28.11.1954, Chicago. F. was 
a professor in Florence and Rome before going to Columbia University in New York in 


6.4 Biographical Notes 


1939. There he stayed until 1946, when he went to Chicago. F. was mainly engaged in 
quantum mechanics. He discovered the conversion of nuclei by the bombardment of 
neutrons, and, beginning in 1934, was thus able to create many new synthetic radioactive 
substances that he thought were transuranic. F. formulated the statistics named after him 
(Fermi statistics) in his treatise “Sulla Quantizazione del gas perfetto monatomico” 
(Lincei Rendiconti 1935; Zeitschrift fiir Physik 1936). In 1938 he was awarded the Nobel 
Prize in Physics. During World War II, F. was substantially engaged in projects devoted 
to making use of atomic energy. Under his guidance the first nuclear chain reaction was 
performed at the Chicago nuclear reactor on 2.12.1942. In memory of F., the Enrico 
Fermi Prize was established in the United States. 
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7. The Harmonic Oscillator 


As an application of the Schrédinger equation, we now calculate the states of 
a particle in an oscillator potential. From classical mechanics we know that such 
a potential is of greater importance, because many complicated potentials can 
be approximated in the vicinity of their equilibrium points by a harmonic 
oscillator. Expanding a potential V(x) in one dimension in a Taylor series yields: 


V(x) = Via + (x — a)) = Via) + V(a)(x — a) + 4V"(a)(x — a)? + 
(7.1) 


If a stable equilibrium exists for x = a, V(x) has a minimum at x =a, Le. 
V'(a) = Oand V'"(a) > 0. We can choose aas the origin of the coordinate system 
and set V(a) = 0; then an oscillator potential is indeed a first approximation in 
the vicinity of x = a, i.e. in the vicinity of the equilibrium point. 

In the following we shall consider the one-dimensional case. Then the 
classical Hamiltonian function of a particle with mass m oscillating with fre- 
quency w takes the form 


oi Se (7.2) 


and the corresponding quantum-mechanical Hamiltonian reads 


a faa? am 

H= —-~——54+-~0’x’ . (3) 
Cyd | 

Since the potential is constant in time, the time-independent (stationary) 

Schrédinger equation determines the stationary solutions wy, and the corres- 

ponding eigenvalues (energies) E,. The stationary Schrddinger equation takes 

the form 


lide 


— W(x) + Feo? x2Wx) = EW(x) (7.4) 
2m dx 


2 

Because of the importance of the harmonic oscillator and its solutions for 
quantum mechanics, we will now consider the method of solving this differential 
equation in detail. Using the abbreviations 


7 ela a 1=—— (7.5) 
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ON ee SE 


we can rewrite the differential equation as 


—st(ke?-x7)~=0 . (7.6) 


Equation (7.6) is known as Weber's differential equation. For further simplifica- 
tion, we introduce the transformation 


y=Ax? , (7.7) 
and obtain 
a7 Nad i 
See, Nid nase alerts =0 , 7.8 
yee (5 >)¥ ue) 
with 
2 2 
kak Ee (7.9) 
22 2mw hea 


To rewrite (7.8) in standard form, we split off the asymptotic solution. The latter 
can be inferred by examining the dominant behaviour in terms linear in y for the 
asymptotic region y— «. Hence, we try writing 


Wyy=e 7 oly). (7.10) 
Using 

dy i Ol ees aw 1 dO do tae 

—_—_ = —_—— == j d — —_—_ —— yi 3 


the differential equation for @(y) follows from (7.8): 


ao 1 do ae 
ee ey | eg ee ; TAM 
dy? (5 ’) dy (5 ie : a 


Before further examining (7.11), we shall digress to the field of hypergeometric 
functions. Our aim is to understand the basic mathematical features as well 
as possible without going into rigorous derivations; a heuristic treatment will 
suffice. 


RRS ——EE EE _—_laSS=-==ss 


7.1 Mathematical Supplement: Hypergeometric Functions 


The Hypergeometric Differential Equation 
The hypergeometric differential equation, expressed by C.F. Gauf in the form 


2 
ee Re Gh is ay 0 (1) 


dz? dz 


7. The Harmonic Oscillator 
contains the three free parameters a, b, c and possesses a great variety of 


solutions. It has three nonessential singularities at z = 0, 1, 06. To solve (1), we 
substitute the power series 


OZ) = 2" y Cee 


into the differential equation (1) and find the recurrence relation 


z(t — z)z? S c(v + o)(v+o—1)z"~? 
v=0 
+ [ce —(a+b +4 1)z]z’ Yeo + o0)z”—' — abz° 5 2) 
2 v=0 
(2) 
Multiplying out the factors and re-ordering the terms yields 
coa(c + 6 — 1)z77* + Y Feral +o4+1)(vt+tect+o) 
—¢(vtatolv+b+a)]z"77=0 . (3) 


For this expression to vanish identically, all the coefficients have to be equal to 
Zero, 1.€. 


oie —1+6)=0 (the “index equation’) and (4) 
(vtatoj(v+b+o) 


+41 : 5 
Cy41 (v+tl+o)\vt+c+a) (>) 
One solution of (1) (if we set co = 1) is therefore given by 
pee er G0) 
p(z) =z (6) 


ol! + a),(c +O), 
using the abbreviations (Pochammer symbols) 

(a, Slo ON) oe (Ge 

(op (7) 
The radius of convergence can be inferred from the ratio test for convergence, 


cy 


cy +1 


=1. (8) 


r= lim 
vy 00 


The index equation (4) yields two possible values for the exponent o: 
1) ¢ = 0. The solution in this case is the hypergeometric series 
= (a)y(B)y 2" 
b1(2z) = 2F ila, bs = ea : (9) 
The indices appended to ,F, are related to a generalization of the hypergeo- 
metric series in the form 


Plein 6 lence = ee z . (10) 
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Exercise 7.1 


The solution (9) only makes sense if, in the series > F,, none of the denominators 
of the various terms of the series vanishes, i.e. the existence of ,F, implies the 


condition that c # —n, where n = 0,1,.... Then the series is holomorphic in 
the unit circle. When a = —1n or b= — 4, the series terminates and defines 
a polynomial of nth degree. For example, 

oa as bess 2 (ll = 2) (11) 


is a Legendre polynomial (see Examples 4.8—10). Further special cases are, among 
others, the Gegenbauer and Tschebycheff polynomials.’ 

2) o = 1 —c. According to (6) and (9), the second solution may be expressed 
by the hypergeometric function with changed parameters, namely 


(eS Gs la=eo-l=ae le a. (12) 


Note the factor z'!~° before the hypergeometric function ,F,. The solution 
@o OMlvyewISES ley -2y ene 
The general solution of the hypergeometric differential equation is therefore 


(z) = A2F,(a, b;c;z) + Bzi~5F\(a+1—c,b+1—c2—c2) , (13) 


under the condition that c is not a positive integer; otherwise there is only one 
single solution. The second independent fundamental solution then becomes 
more complicated. 

For the analytic extension of the solution beyond its region of convergence 
we use the appropriate formula 


P(c)F(b — a) 


1 
F boc = __ 4\-a = ee a 
oF, (a, b; c;z) Trea) = a)! Zz) "Fy (« l[-~c+a@1l—b+a; :) 
I'(c)F(a — b) - 1 
EL (one jE = 25 [eee foes 
T(aF(e— b)' z) aE jl Del c+b;1 Ba he 
(14) 
From this, the asymptotic behaviour for |z| + co follows: 
[(c)I'(b — a) _,  F(c)F(a— b) p 
Fy (a,b; c; z) = ——_———( — z)“* + ————_~(-z)? . 
ge = nan) Se ue) 


The Confluent Hypergeometric Differential Equation 


By analytical continuation of the unit circle to the entire complex plane we may 
infer another important differential equation from (1). Substituting the linear 
transformation x = bz into (1) leads to 


x\ x | 
(1-Z) eo +[e-t ne x | ap =0 (16) 


1 See, for example, George Arfken: Mathematical Methods for Physicists, 2nd ed. (Aca- 
demic Press, New York 1970) or Milton Abramowitz and I.A. Stegun: Handbook of 
Mathematical Functions (Dover Publ., New York 1972). 


7. The Harmonic Oscillator 


In the limit b > 00, we get the Kummer differential equation: 
a = 17 
— +(ec-x)—-ag=0 . 
x dx : 07) 


This equation has a nonessential singularity at x = 0 and an essential one at 
x = 00, which arises through the amalgamation (confluence) of z = 1 and z = oo. 

The general solution of (17) is obtained by again expanding in a power series 
arovad x = 0. Therefore we have 


(x) = A, Fi (a,c, x) + Bx’, Fi(a—c+1;2-—c¢%x) , (18) 
with the confluent iil function 


ax a(at+1)x? 
= eas ce ae 19 
= a “ie aren elas oo 


1 Mae Xx) 


The solution nce originates from (13) in the limit b 00, with x = bz. This is 
quite obvious. The series (19) exists only on condition that c # — n. It converges 
for arbitrary values of x. The case a= —n again yields a finite polynomial. 
Special cases are the Hermite and Laguerre polynomials. 

The asymptotic behaviour for |x| — 00 is 


I(c) iany ect) ie) e~ AUS 
Fi (a cS area af =e F(a i x é (20) 
For a= —n, polynomials of nth degree arise, in particular the Laguerre 
polynomials, 
J Egm(zy = AE mm tz), (21) 


nim! 


and the Hermite polynomials, 


H,,(z) = (- one Fi( 32) and 


et prt. Ai(- we ) | (22) 


We finally quote a useful integral formula for hypergeometric functions”: 
fe "77! 4FeL(a), (0) kt] a Fe[(@’), (0°); Kt] dt 
10) 


= 5~*(d) 2 ae + Fe [(a’), d + m; By k’/s] (23) 
m=0 m 


2 See L.J. Slater: Confluent Hypergeometric Functions (Cambridge University Press, 
Cambridge 1960) p. 54. 
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with the following notations: 
(a)m = ala + 1)(a + 2)... (a+m—1) and 
aF’gL(a), (0); 2] 
= lone. as One ee 


= 2a Ga 2 GiGi 1)a5(a5 al ea (ee Wee 
- ess CE VES 1). Dep ae) 2k 
(24) 


7.1 The Solution of the Oscillator Equation 


Comparing (7.11) with Eq. (7) of the foregoing example, we identify (7.11) as 
Kummer’s differential equation. The general solution is given by (18) of the 
example: 


OG) = A aR se eee) (7.12) 


where 


fe Il 
es -(5-4) ; (7.13) 


The solution of our physical problem is determined by the wave function in 
(7.10). Therefore the necessary square integrability of y implies that w has to 
vanish at infinity. Nevertheless, as we see from (20) of the example, both 
particular solutions, so long as they are not finite polynomials, behave for large 
values of y as follows: 


yoo: o(y)> conste” y*- 1/2; ie. 


w(y) =e”? o(y) > const e/? y4- 1/2, (7.14) 


This means that the normalization integral diverges. However, if for the hyper- 
geometric series the condition for break-off (termination) is fulfilled, ¢ becomes 
a polynomial. Owing to the factor exp( — v/2) [see (7.14)], w will vanish at 
infinity. Therefore the requirement for normalization leads, in consequence of 
the condition for break-off (i.e. the hypergeometric functions terminate and 
become polynomials), to the quantization of energy. Let us now consider the two 
possible cases. 


I) a= and Bi) with 0 ee ee 


7.1 The Solution of the Oscillator Equation 


with the eigenfunction 
Ue) Newer i(— ag. 4x") (7.15) 


and the energy 


b= hones). (7.16) 
2)a+4= —nie. 

pe Il ee 1 

5) =n 5 4 


with the eigenfunction 
UAC ehren hy ( ANG) (C7) 
and the energy 
E, = ho[{(2n+1)+4) . (7.18) 
Using (7.9) we find for the energy values: 
E, =(2n+4)hw and 
E, = (2n + 3)hw = [(2n + 1) + $]ho 
Combining these two results, we obtain the discrete energy spectrum: 
ai nen — 0) 12. (7.19) 


As we see, the energy spectrum of the harmonic oscillator is equidistant with the 
spacing hw and has a finite value in the ground state (n = 0), the zero-point 
energy yhw (see Fig. 7.1). 

The polynomials occurring in (7.15) and (7.17) are known as Hermite poly- 
nomials. With the usual normalization factor, they are defined as 


! 
lly 26) ae a epee oy. (7.20) 


The eigenfunctions and energies (7.1518) can then be written as 
a) Un = Nye 2" Ha/ Ax) 5 Ex = (Qn + d)ho ; 
b) Wa = Nye —2" Hana il/AX) » 
E, =(Qn+1)+4]ho n=0,1,2... 5 
and can finally be collected uniformly as 


Un = Nao 4" H/ax) 
E,=(n+i)ho n=0,1,2... . (721) 
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For the Hermite polynomials the useful relation exists: 


x2 d" yr 
Tle (al) cee qe (7.22) 


which we will prove in the following example. 


EN AMP |b (eee eee Set 


7.2 Mathematical Supplement: Hermite Polynomials 


On the basis of the foregoing considerations, the functions exp[ — (A/2)x?] 
Bl) i.e. the Hermite polynomials multiplied by exp[ — (4/2)x?], obviously 
fulfil the differential equation (7.6) if 


2m 2m 
k? = ape = ry —hw(n + ss) 
a 


——(n + 4) = 24(n + 4) 


Therefore substitution of exp[ — (Fe Wehies 9) into (7.6) and straightfor- 
ward calculation yield 


£ expt ~ (2/2)? 1Hal/Ix) = — Axexpl — (2/2)x*] Hal /2%) 


dH,(./2: 
+ exp[ — yaya od a2) 


2 


ae 
an exPl — (4/2)x?]Haly/Ax) = (ax) exp — (4/2)x?] Ha /2x) 


+ exp[ — tapynry abla) — Aexp[ — (4/2)x?] H,(,/Ax) 


— 2dxexp[ — apy (1) 
Thus 
(2x2 — AN) re) = aj bfinly/ Ax) aa PHA SAX) 
dx dx? 
+ [2a(n + 4) — 42x? H,(/4x) = 0, (2) 


or 


dG: 
ae — 2Ax gs + 2,nH nl /AX) = 


7.1 The Solution of the Oscillator Equation 


Introducing the variable € = i we obtain after division by 4 


d°H,(€) dH,(¢) 

dé ecomppeman oS) 0 , MH ON Ze. - (3) 
This differential equation is the defining differential equation for the Hermite 
polynomials if n is a positive integer. From (3) a significantly more elegant and 
manageable formulation of the Hermite polynomials can be given, using the 
generating function S(é, s), so that 


x2 ase a ‘ [ea) He 
ee eee yy ets) 


Beg 


sm (4) 


Expanding the exponential function in terms of powers of s and €, we see that 
the coefficients of the powers s" are polynomials in terms of € — the Hermite 
polynomials. This can be shown as follows: we have 


os hae a Qont l eg s" 0H,(é) 
—_- = N = ss H = — 
d€ ase dX n! lf) f= nt ol 
os Bee as el 
mee iB s So ee ae ei 
nl = esa La n(6) 
00 gti 5 
= fa AG Vet 
Zee eh 


Equating equal powers of s in the sums of these two equations, we obtain 


| a = nH, (2), Hyeil€)= 28H(E)— 2nH ys) 6) 
Therefore it follows that 
Sn ac a) 6 (7) 
a 
and hence 
Galt (e)am 0H,(€)  0Hn+1() 
Fine = DHE) + 26 7 
a 2 + 2H,(é) — (2n + 2)H,(2) 
= aT QL Ac) 


This is exactly differential equation (3), proving that the H,(€) appearing in the 
generating function (4) are indeed Hermite polynomials. . 
The recurrence formulas (6) may be used to calculate the H, and their 
derivatives. Another explicit expression directly obtainable from the generating 
function is quite useful; let us now establish this important relation. From (4) it 
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follows instantly that 


a"S(E, s) 


of =—- of : (10) 
Os 0g 
Thus 
o"s 2d"e (G6) Ire’? on ee (1) 
—oe — —_ — a. 
Ose os" Cea 
Comparing (11) with (9) yields the very useful formula, 
Ge 
H,(é) =(— 1"e’— en (2 
2) =(- Ife aa (12) 


The H,(€) are polynomials of nth degree in € with the dominant term 2"&". 
The first five H,(&) calculated from (7.22) or (12) of the foregoing example are: 


Gs ale HiGi=2. 


H,(€é)=467-2 , H3(€)= 86-126 , Hy(€) = 1664 — 48¢? + 12 
(7.23) 


The eigenfunctions (7.21) were combined by introducing the abbreviation 


a Jax and using the Hermite polynomials in a way that holds for both even 
and odd yn, i.e. 


W(x) = Nye HE) , €= fax. (7.24) 


The constant N,, which depends on the index n, is determined by the normal- 
ization condition 


J WaG)Pax = 1, (7.25) 


since we require the position probability to be 1 for the particle in the entire 
configuration space. Thus 


oO 


1 A es 

I lWa(x)Pdx = ae We 8 (Cael (7.26) 
Using relation (12) of Example 7.2 to express one of the Hermite polynomials 
that appears in the integrand of the normalization integral, the evaluation of this 
integral becomes simply 


90 N2 00 ae _ 
|W n(x)|7dx = ( oa cA J . Hill) oeae dé : (7.27) 


7.1 The Solution of the Oscillator Equation 


By partial integration we obtain 


ica) n 


quai A ree) oO dH dr} 
— pea H = n —&2 
ie Of Sede ee 
The first term is, because of (12) in Example 7.2, equal to (— 1)"~'e~® 


H,,-,(€)H,(€). It vanishes at infinity, due to the exponential function. 
Having carried out partial integration n times, we are left with 
ioe) >, ale 235 - ive) aH, 
i Halo) Gene dé =(— 1)" tl ede . (7.29) 


den 
Since H,,(é) is a polynomial of nth order with the dominant term 2"¢”, for the nth 
derivative, 
ae 
dg 
holds. 
From this we find that 


H,(é) = 2"n! (7.30) 


n 


ole 


i H,( e dé =(— 1)"(2")n! { ere = (= OR a 


and for the normalization constant, 


N A 1 
ie 1 2"n! 


The stationary states of the harmonic oscillator in quantum mechanics are 
therefore 


Wax) = Fa [eeve( — 2") H,(./Ax) (7.32) 


Here we have suppressed the phase factor ( — 1)", since it is not essential. To 
discuss the solution, we take a look at the first three eigenfunctions of the linear 
harmonic oscillator (see Fig. 7.1): 


n= 0: (x) = «exo( _ i?) , 
n=1:W,(x)=2 [S exo — sat) Jas : 
n=2:2(x)= f Fexo( -- sat as? =2). (7.33) 


From (7.24) and (7.30) it follows that, for space reflection, the eigenfunctions 


15s 
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Fig. 7.1. Oscillator poten- 
tial, energy levels and cor- 
responding wavefunctions 
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have the symmetry property 

Wal — x)= (— Iv (x) (7.34) 
This means 

neven: wW(— x)= W(x) parity + 1 

nodd w(— x)= — w(x) parity — 1 


For the lowest H,,, it can easily be shown that they possess precisely n different 
real zeros and n+ 1 extremal values (see Fig. 7.1). With respect to (12) in 
Example 7.2, we have 


Ho = — ee (7.35) 
dg 

On the assumption that H, possesses n+ 1 real extremal values, we can 
conclude the existence of n + 1 extremal values for e °° H, (since e > — 0 for 
&-»co). The extremal values are identical with the zeros of the derivative 
d/dé; therefore H,,,, has precisely n + 1 real zeros. This conclusion shows that 
the Hermite polynomials H,(€) — and, in consequence, the wave functions 
w,(€) — possess n different real zeros. This is a special case of a universally valid 
theorem which states that the principal quantum number of an eigenfunction 1s 
identical with the number of zeros. 

In Fig. 7.1, some of the w,, are plotted together with an energy diagram. The 
energy eigenvalues are represented as horizontal lines with the quantum seg- 
ments E,, = (n + 4)hm. For each of the lines there is a corresponding eigenfunc- 
tion w,(x) drawn on an arbitrary scale. 

In addition, the figure contains the function of the potential energy 


Vix) =4mw?x? . (7.36) 


Thus we can make a comparison with the classical harmonic oscillator, which 
oscillates with a certain amplitude characterized by the vanishing kinetic energy 
at the turning point. Since E = T + YV, the region of classically possible oscilla- 
tions is bounded by the point of intersection of the parabola V(x) and the 
straight line of total energy E. As a matter of fact, the figure shows that the 
extreme values of the function WY are localized within the classical region, 
nevertheless, tails of the wave function extend to infinity. 

The deviating behaviour becomes even more significant if we contemplate 
the position probability of the particle. Let T represent the period of revolution 
of the particle, then classically we have 


eae Cece) ae dx 137 
ss SO Tedd Cy 
The particle performs harmonic oscillations: 
: dx 
xX=asno@t , —=awcoswt = aa,/i—(x/a)’ ; (7.38) 


dt 


7.1 The Solution of the Oscillator Equation 


Sz 


hence, 
d 
x 
ma /1 — (x/a)? 


The amplitude a is obtained from the energy E = 4mw2a?, ic. a = \/2E/mo?. 
Contrary to this, the quantum-mechanical probability for localizing a particle 
within an interval x + dx is given by (see Fig. 7.2): 


Welx )dx = (7.39) 


Waulx)dx = |W(x)|?dx_, (7.40) 
which means, e.g. for n = 1 with respect to (7.33): 
A 2 
Wau(x )dx = |W (x)|?dx = 2 fe ex dx (7.41) 
Tt 


It can easily be shown that w,,(x) has a minimum at x = 0 and a maximum at 


<a ee 7.42 
maxqu —_ } el ma > ( Y ) 
whereas classically, with E = 3/2hq, it holds that 
| 2E [3h 
Xmaxcl = LaA= + Dt ae eee (7.43) 
mo mo 


The agreement between classical and quantum probabilities improves rap- 
idly with increasing quantum number ». A plot for n = 15 is given in Fig. 7.3; for 
large quantum numbers (here n = 15), the mean value of the quantum distribu- 
tion approximates the classical limit. 


ee 


From the figures we perceive that beyond the region which is classically 
limited by the relation E = 7+ V, the probability density is not equal to zero. 
This is a consequence of the fact that Tand V are noncommuting quantities, 1.e. 
they do not have simultaneously exact values, since V(x) is a function of space, 
whereas J = p*/2m is a function of momentum. Therefore, owing to the uncer- 
tainty relation [p, x ]_ = — 1h, it is impossible to split the energy precisely in 
E= T+ V. It would seem that localizing the particle beyond the classically 
permissible region implies a violation of energy conservation; however, this is 


Fig. 7.2. Comparison of the 
probability density for find- 
ing a particle moving in 
an oscillator, classically and 
quantum mechanically. The 
dashed lines denote the clas- 
sical points of revolution 


Fig. 7.3. Quantum-mechan- 
ical and classical probability 
densities for a particle in 
a harmonic oscillator po- 
tential with the energy 
E = (15 + 1/2)ho, ie. in the 
n=15 state. The dashed 
vertical lines indicate the 
classical points of revolution 
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not the case. If we try to localize the particle (i.e. concentrate its wave function) 
in the small tails of the function w, the uncertainty of momentum increases to 
a point where the new total energy exceeds the value of the potential energy 
V(x). Thus, from the point of view of energy, the particle is allowed to adopt an 
x-value beyond the classically permitted region. In any case, it 1s the wave 
character of the quantum-mechanical wave function which allows the penetra- 
tion into potential wells and, finally, its tunnelling. This effect is analogous to the 
jumping of electromagnetic waves (light) over narrow slits.* 

The behaviour described above is responsible for the tunnel effect, according 
to which a potential well of size Vj can even be surmounted by particles with 
energy E < Vy. The tunnel effect appears, for example, in the case of field 
emission and a decay. It has recently received particular attention because of its 
practical application in the so-called tunnelling-electron microscope.* A further 
difference between the classical and quantum-mechanical oscillator is the state 
of minimum energy. Classically a particle can be in the state of equilibrium at 
x = 0, p = 0, E = 0. In quantum mechanics the smallest possible energy value is 
E = ha /2, the zero-point energy. 

This zero-point energy is a direct consequence of the uncertainty relation 


= 
Ax Ap = Te (7.44) 
Let us take a closer look at the expressions 


Ap (pap )—p opp Pp =e — 2p ap (7.45) 


Analogously, 4x? = x? — ¥*. On the other hand, in a state with a fixed energy 
value, the mean values p and x are equal to zero since the integrand is an odd 
function: 


F=f UEC)xVle)dx= f Wad Pxdx=0 (7.46) 


and 


Pe= J VECOP Idx = — ih | VECO VaCods 


We 2 al ih oO 
= — 5 J plalePdx = —Slalx)l?] =O - (7.47) 
Therefore 
Aye ae ay, A ee (7.48) 


* See J.D. Jackson: Classical Electrodynamics, 2nd ed. (Wiley, New York 1980) and 
W. Greiner: Theoretische Physik, Bd. 3, 4. Aufl. (Harri Deutsch, Thun and Frankfurt a.M. 
1986). 

* In 1986 G. Binnig and H. Rohrer received the Nobel Prize in physics for developing the 
tunnelling-electron microscope; see e.g. G. Binnig and H. Rohrer, Scientific American, 
Aug. 1985, p. 40. 


7.2 The Description of the Harmonic Oscillator 


With this we may write the uncertainty relation in the form 


pied h2 
oes (7.49) 


Ass} 


Now the average energy of the oscillator is 


(7.50) 


Comparing both equations, we see that an increasing potential energy leads to 
a decrease of kinetic energy and vice versa. 
Combining these equations, we obtain 


a ae ania (7.51) 


The function H = H(p’) possesses a minimum at p* = 4mah, which can easily 
be confirmed by evaluating the first and second derivative. 

Since a state of fixed energy is characterized by H=E, we obtain as 
minimum value of the possible energy eigenvalues 


; ha 
min E > — = Eg 


The zero-point energy Eo therefore is the smallest energy value that is compatible 
with the uncertainty relation. 

. It is possible to indicate the zero-point motion (which leads to the zero-point 
energy) by observing the dispersion of light in crystals. In solid bodies, atoms 
and molecules perform small oscillations; according to classical theory, they 
ought to diminish with decreasing temperature. These atomic oscillations are 
the cause of the light dispersion, which therefore should vanish with decreasing 
temperature. However, experiments demonstrate that the intensity of the scat- 
tered light converges toward a finite limit, showing that even at the absolute 
zero-point, atomic oscillations occur. 


7.2 The Description of the Harmonic Oscillator 
by Creation and Annihilation Operators 


The normalized eigenfunctions of the harmonic oscillator all take the form 


isl) = ae PPE) ea (7.52) 
J J n2"n 


For the Hermite polynomials H,(é), the following recurrency relations hold 
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(see Example 7.2): 


d 
Ga het + tHy41 > fe = n= : (7.53) 


From these formulae, connections between eigenfunctions of the harmonic 
oscillator, which belong to neighbouring quantum numbers, are obtainable: 


in n+ | 
= |= ae 54 
Wn + ap 5 Watt ’ (GS ) 


v=? /% Wee 1 =i . Wes) 


Equation (7.55) is now rearranged, using (7.54), so that the right-hand sides of 
both equations resemble each other: 


0 n n+ 1 ; 
Si [es ee (7.56) 


By addition (or subtraction) of (7.54) and (7.56), we get the relations 


1 ) 1 7) ar 
ale + seem Vie ’ “ale ae) — ar Len 0 (759) 


With these equations we can now evaluate the neighbouring eigenfunctions 
W,-, and W,, from the eigenfunctions y,. For the sake of brevity, we define the 
operators 


Oe 1 oe 
aaletag)=2 : sale ae) =4 : a 


and hence obtain, instead of (7.57), 


ei oO Win) PRE Wien (7.59) 


For the present we will call @ the lowering operator and 4” the raising operator 
since the index n of the state w,, is lowered or raised, respectively. In the following 
we formulate a better interpretation of 4 and d* and then also name the 
operators more appropriately. 


7.3 Properties of the Operators d and 4~ 


The operators 4 and 4" are adjoint to each other (ic. not self-adjoint), since it 
holds that (with partial integration) 


ju (€o + ae - i(eue - J ode ; (7.60) 


7.4 Representation of the Oscillator Hamiltonian 


or, abbreviated, 


Wlap>= <a" Wle> . (7.61) 
We have made use of the fact that the operators are real from their definition 
(7 Solete a — a" anda (= 1a)" 

The wave function W, is an eigenfunction of the operator product d*4 
because 


ata), = /nd*W-1 = 1, (7.62) 
which can be checked with (7.59). The eigenvalue n is the index of the oscillator 
wave function w,. We therefore define a number operator N 


N=d*ad , NU,=nm, . (7.63) 
The eigenvalues of N are n; the eigenfunctions are W,. We get the commutator 
(agate tial el (7.64) 


easily by evaluating the two products according to (7.58). 
By successively applying d* on w, we are able to calculate all eigenfunctions, 
starting from the ground state. From (7.59) it follows that 


1 1 : 
Wn rae Sel Vy ees a : (7.65) 


Ji Ji 


Up to this point we have developed a formalism in terms of @ and a” that 
enables us to set up a differential equation for the ground state. With n = 0, we 
find from (7.58) and (7.59): 


- do =0 and fot 2 =i (7.66) 


The substitution wo ~ e~* yields « = — 4. Thus the function of the ground 
state is, up to a normalization factor, 


Wo~ enna 


which coincides with the solution of the Schrédinger equation for the harmonic 
oscillator, yielding for the normalized ground state (7.33) 


ae 2 er 82 (7.67) 
7 


7.4 Representation of the Oscillator Hamiltonian 
in Terms of @ and @* 


For the one-dimensional harmonic oscillator, the Hamiltonian is given by 


foe loa 
pits eee ; : 7.68) 
2m 6x? a Due 4 ( 


a 
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Corresponding to the introduction of the new variables ¢ = as = UO 
we can now define a new momentum operator 


ee 7.69 

Pe Ne Pe B82 men Ax? io!) 
so that the Hamiltonian becomes 

ae. Sl 1 q? 

H= x hole’ + bz) = shoe _ a) : (7.70) 
From the relation 

ag 
= 4 a ae eee 


which is easily verified with (7.58), and by use of the commutation property 
(7.64) and definition (7.63), we may infer a simple representation of the Hamil- 
tonian, namely 


H =ha(4*ad+4)=ho(N +4) . (7.9.0) 
From this we can evaluate the energy eigenvalues 
Hy, = ho(N + 2)Wn = ho(n + 3)bn = Ena (7.72) 


The energy eigenvalues are E, = hw(n + 3), as calculated above. 


7.5 Interpretation of d and 47 


The ground state Wo has the zero-point energy Ey = hw/2. Since the energy 
spectrum of the harmonic oscillator is equidistant, the state wy, possesses an 
energy value that is larger by the term nhw. We will distribute this energy to 
n energy quanta hw (quanta of the oscillator field), called phonons. W, is called 
an n-phonon state. In Dirac’s notation it reads 


W,=|n> . (7.73) 


The “kets” |n> contain the number of phonons. The zero-phonon state |0> is 
also called the vacuum. Using the notation above, the equations (7.59) become 


a|ny=./n[n—1) , a*|ny)= fn4+1[n4ld . (7.74) 


The following interpretation is appropriate: if acting on the wave function, the 
operator 4 annihilates one Shee a at a time, whereas d* creates one. Therefore, 
from now on we will call @ and 4* the annihilation operator and creation 
operator, respectively. N is termed the phonon number operator, since its eigen- 
values, given by the equation 


N\|n)=n|n> , (775) 


are the numbers of phonons of the corresponding state. 


7.5 Interpretation of a and a* 


4|3) : Annihilation of a 
phonon 

a*?|1) Creation of two 
phonons 


The introduction of the phonon representation is often referred to (some- 
what imprecisely) as second quantization. The quanta of the wave field of the 
oscillator are exactly the phonons. This becomes clear if we consider the analogy 
with photons. 

However, for a complete mathematical treatment of the second quantization 
of the electromagnetic field, it is necessary to use quantum field theoretical 
methods.° 


EXAMPLE 


7.3 The Three-Dimensional Harmonic Oscillator 


Problem. Determine the eigenvalues and eigenfunctions of the Hamiltonian for 
the three-dimensional, spherically symmetric harmonic oscillator. 


Solution. The Hamiltonian for the three-dimensional, spherically symmetric 
harmonic oscillator is of the form 


tee (1) 


Because of the symmetry of the problem, we solve the stationary Schrodinger 
equation 


AW a — Ena (2) 
in spherical coordinates. n, | are quantum numbers characterizing the eigen- 
functions and will have to be further specified. The Laplacian in spherical 
coordinates 1s 


a2 206 ~« TL? 
ee ee oe ae > (3) 


5 See W. Greiner, J. Reinhardt: Quantum Electrodynamics, 2nd corr. ed. (Springer, Berlin, 
Heidelberg, New York 1994). 
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Fig. 7.4. Energy levels of the 
harmonic oscillator and the 
effect of the creation and an- 
nihilation operators 
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Example 7.3 where the angular-momentum operator L contains the derivatives with respect 
to the angles Gand @ [see (4.75-77) and (4.82) ]. The eigenfunctions of [? are the 
spherical harmonics [see (4.76-80) and Example 4.9]: 


bE) SOI) (4) 
To separate the angular and radial parts of the wave function Wy,, we try 
writing 

Rul) 
Vnim(t> 9, —) == Yan 9, 9) (5) 


Substituting (5) into the Schrédinger equation (2) and using (1) yields a differ- 
ential equation for the radial part of the wave function R,,,(r): 


ee ii i+ 1) 
Ra + ( —o h2 ry — oe Jp nr) = 0 : (6) 


Using the abbreviations in (7.5), this differential equation becomes identical with 
(7.6), except for the angular-momentum term /(/ + 1)/r?, usually termed the 
angular-momentum barrier, 


+1 
Ri, + (« aye 2 a =0. (7) 


This differential equation can be transformed by a suitable substitution for 
R,, to the same standard form (7.8) as previously (7.6). 

This substitution will differ from (7.10) because of the angular-momentum 
barrier. As before, in the case of the linear oscillator, we try to split off the 
asymptotic behaviour of the wave function. If roo, we may neglect the 
angular-momentum term /(/ + 1)/r?. Then the solution of the differential equa- 
tion has to behave like 


Rult) —> ~ expl — (/2)r7] (8) 


At r = 0, the angular-momentum term becomes dominant, independent of the 
potential. Thus we try an expansion in a power series 


2 + 
Ry = r* > a,r' 
= 
Substitution into the asymptotic differential equation 


Kl + 1) 


fi — Ry = 0 (9) 
yields 

a(a—i)—i+i)=0 , 
with solutions a; = — 1, a, =1+ 1. Hence, we get 

Ro) ee fp) Or ae (10) 


7.5 Interpretation of a and a* 


The first possibility in (10) suggests the substitution 

Rr) =r'*texp[ — (4/2)r7 Jor). (11) 
Notice that, quite analogously, we could continue the calculation using 

Rr) = r~'exp[ — (A/2)r? u(r) 


However, this leads to exactly the same solutions as (11). This is not immediately 
obvious, but becomes clear by repeating the following steps, employing the 
substitution (11).° With (11), (7) changes to 


Pl 
ot +2(22 — ar) — Gears 3) — Kyo =0 (12) 


By substituting the variable 
Se (13) 


(12) transforms into a Kummer differential equation [see (17) of Exercise 7.1]: 


Get (145-5 -[5(143)-S foo . (14) 
dt D dt 2 2 2 
with « = k*/24 = hk?/2mw = E/hw [see (7.9)]. It has the solutions 
v(r) = C,F, [40 +4 — #6), 14 3 ar] 4 Cyr PY 
x 1F,[§(—14+4—%«), —1+4,4r’] . (15) 


When | 40, the second particular solution cannot be normalized, since it 
diverges too strongly at r = 0. Therefore we set C, = 0. The same is also valid 
for 1 = 0. To prove this is not trivial, so we will now demonstrate the verification 
in detail. (In the case of the linear oscillator, the second particular solution 
would make sense from a physical point of view. The difference between the 
cases originates from the fact that, earlier, the normalization integral was 
one-dimensional, whereas, now, it is three-dimensional; i.e. it has to be integ- 
rated with a different volume element.) 

We start by requiring that the momentum operator — ih V be self-adjoint, 
since it represents a physical quantity and therefore should have real eigen- 
values: 


fut(—ihV)u, dt = f(—ihVu,)* unde, (16) 


where u,, u,, are elements of a complete orthonormal set of solutions that belong 
to a certain Hamiltonian, e.g. to the Hamiltonian (1): 


[Wee de Ole (17) 


Since the u, form a complete set, we may expand the components of — ih Vu in 


© See J.M. Eisenberg, W. Greiner: Nuclear Theory 1, Nuclear Models, 3rd ed. (North- 
Holland, Amsterdam 1987) pp. 145-146. 


165 


Example 7.3 


166 


Example 7.3 


7. The Harmonic Oscillator 


teriniseol une vin. 


—ihVitln = > Oinle (18) 
k 
Then from (16) we can obtain 
fux(— ihV)(— ihVu,,)dt = J(— ihVu,)*(ihVu,,)dt_, (19) 
or 
fuk Au, dt = —[VukVu,,dt , (20) 


employing Green’s theorem, which requires that the surface integral vanish. If we 
multiply (20) by — h/2m and set n = m, the left-hand side becomes the mean 
value of kinetic energy of the state u,: 


h? 
Eas = — f|Vu,|? de : (21) 
2m 
The second particular solution in (15) with | = 0 behaves like 
Rir) 1 i or i. ae 
ee 8 Fen (eee 2 — (A/2)r 02 
r Fi; saa" 2) 


which means that the integral (21) diverges, whereas the expectation value of the 
potential energy mw*r?/2 remains finite (this straightforward calculation is left 
to the reader as an exercise). The reason for the divergence of the integral of 
kinetic energy is the divergence of the wave function at the origin r = 0. There it 
holds that 

yo 


r ro 


and therefore 


(v =| 2r? dr ~ dr 
r 


This tem obviously diverges. Consequently, we also have to rule out solution 
(22). 

We now return to (15). Employing the same arguments as in the case of the 
linear oscillator, we conclude that, since the solutions are required to be regular 
at infinity, the hypergeometric function has to break off, leading this time to the 
condition 


2c ae) ee eINo 
le. to a quantization of energy: 
En = hoQn+1+3) . (23) 


The term (3/2)hw represents the zero-point energy of the three-dimensional 
harmonic oscillator. Since there are now zero-point oscillations along the x-, y- 
and z-axes, the zero-point energy is three times as large as in the one-dimen- 
sional case. 


7.5 Interpretation of 4 and a* 


The complete, not yet normalized, eigenfunction belonging to the eigenvalue 
Ej. is then 


Vaim(t, 3p) = re~ WI" Fy — 1+ 4 Ar?) Yin( BQ). (24) 


The 2/+ 1 eigenstates with the same (n,!) but different magnetic quantum 
numbers m are degenerate. Furthermore, the states with constant N = 2n+! 
are degenerate, too. Therefore N is sometimes called the principal quantum 
number. The following table shows the degeneracy of the lowest eigenstates of 
the three-dimensional oscillator. 

In spectroscopy, it is common to use the notation s, p, d, f,... for the angular 
momenta /=0, 1, 2, 3,... For example, a Sp state is characterized by the 


principal quantum number N = 5 and angular momentum ! = 1 (see following 
table). 


Spectrum of the three-dimensional harmonic oscillator 


N=2n-+! n Energy Number of 
degenerate states 
0 0 O(s) sho 1 
i 0 1(p) sho 3 
2 1 O(s) 7 1 6 
—ho 
0 2(d) 2 5 
3 1 1(s) 9 3 10 
=ho 
0 3) 2 7 
4 2 O(s) it 1 
2(d) Sho} 5 is 
0 4(q) 2 


The diagram below illustrates the spectrum of the three-dimensional har- 
monic oscillator. The three-dimensional oscillator is of fundamental importance 
for nuclear physics in the so-called shell model for nuclei, when establishing the 
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The level structure of a par- 
ticle in a three-dimensional 
oscillator potential. Notice 
the degeneracy of the higher 
states 
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corresponding Hamiltonian. In fact, it is an essential part of it: the shell model of 
the nucleus is based on the assumption that the individual nucleons move in an 
average potential that is generated by all the nucleons together. 

This average potential is often approximated by the three-dimensional 
oscillator.’ In addition, there is a so called spin-orbit interaction, with which we 
will become familiar when discussing relativistic quantum mechanics. 


7.6 Biographical Notes 


WEBER, Heinrich, German mathematician, *5.3.1842 Heidelberg, 717.5.1913 Stras- 
bourg. W. studied in Heidelberg, Leipzig and K6nigsberg. In 1873 he was professor in 
Berlin, 1844 in Marburg, 1892 in Gottingen and from 1895 on, in Strasburg. W. made 
significant contributions to mathematical physics, number theory and algebra. He was 
also co-author of excellent text books. 


GAUSS, Carl Friedrich, German mathematician, astronomer and physicist, *30.4.1777 
Braunschweig (Brunswick), ¢23.2.1855 G6ttingen. G. was a day-labourer’s son and 
attracted attention very early on because of his extraordinary mathematical talents. 
Beginning in 1791, he was educated at the expense of the Duke of Braunschweig 
(Brunswick). G. studied 1795-98 in Gé6ttingen and took his doctorate in 1799 in 
Helmstedt. From 1807 on, G. was director of the Gottingen observatory and professor at 
the University of Géttingen. He refused all offers to go elsewhere, for example, to join the 
academy in Berlin. G. started his scientific activities in 1791 with investigations concern- 
ing the geometric-arithmetic mean, the distribution of prime numbers, and in 1792, the 
foundations of geometry. By 1794 G. had discovered the method of least squares, and in 
1795 he began to work intensely on the theory of numbers, for example, on the quadratic 
reciprocity law. In 1796, G. published his first work; in it, he furnished proof that 
equilateral n-gons, in particular the 17-gons, can be constructed with a compass and 
straightedge, if » is a Fermat-type prime number. In his doctoral thesis (1799), G. 
furnished proof of the fundamental theorem of algebra, which he followed with further 
proofs. From his bequest we know that in the same year G. had already developed the 
foundations of the theory of elliptic and modul functions. His first extensive work, 
published in 1801, is his famous Disquisitiones arithmeticae, which marks as the beginning 
of modern number theory. It contains the theory of quadratic congruences and the first 
proof of the quadratic reciprocity law, the theorema aureum, as well as the science of circle 
partitions. From about 1801 on, G. became interested in astronomy. The results of his 
studies were the calculation of the planetoid Ceres’s orbit (1801), research into secular 
perturbations (1809 and 1818), and the attraction of the universal ellipsoid (1813). In 
1812, G. published his treatise on the hypergeometric series, which is first correct and 
systematic investigation of convergence. From 1820 on, G. devoted himself increasingly 


’ See J.M. Eisenberg, W. Greiner: Nuclear Theory 1, Nuclear Models, 3rd ed. (North- 
Holland, Amsterdam 1987). 
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to geodesy. His most outstanding theoretical achievement was the theory of surfaces, 
which contains the Theorema egregium (1827). G. practised geodesy, too, e.g. he per- 
formed comprehensive surveys in the years 1821-25. In spite of such extensive achieve- 
ments, in 1825 and in 1831 his publications on biquadratic remainders also appeared. The 
second of these treatises offers a description of complex numbers in the plane and a novel 
theory of prime numbers. In his last years G. also enjoyed physical problems. His most 
important contributions are the invention of the electric telegraph, made in 1833/34 
together with W. Weber, and in 1839/40 the potential theory, which became a new branch 
of mathematics. Many of G.’s important results only became known from his diary and 
letters; for example, G. had discovered non-Euclidean geometry by 1816. G.’s reluctance 
to publish important results originated in the extraordinarily high standards he set for 
the presentation of his research, and in his efforts to avoid unnecessary argument. 


KUMMER, Ernst Eduard, German mathematician, *29.1.1810 Sorau (Zary), 
+14.5.1893 Berlin. K. was a teacher at the grammar school in Liegnitz from 1832 to 1842; 
later he was at the University of Breslau (Wroclaw) until 1856. In the following years, 
until 1883, he was a professor at the University of Berlin. His major mathematical 
achievements are the differential geometry of congruences and the introduction of the 
ideal numbers to the theory of algebraic number fields. 


LAGUERRE, Edmond Nicolas, French mathematician, *9.4.1834, 713.8.1886 Bar- 
le-Duc. L. was one of the founders of modern geometry. He became a member of the 
Academic Francaise in 1885. Besides geometrical problems (especially the interpreta- 
tion of imaginary and real geometry), L. particularly furthered the theories of algebraic 
equations and continued fractions. 


GREEN, George, English mathematician, *14.7.1793 at Nottingham, 731.3.1841 at 
Sneinton near Nottingham. Besides his activities as successor to his father who was 
a baker and miller, G. followed closely all discoveries concerning electricity and read the 
works of Laplace. After further studies at Cambridge, he worked there at Caius college. 
His main work “Essays on the Application of Mathematical Analysis to Theories of 
Electricity and Magnetism” (1828) represents the first attempt at a mathematical descrip- 
tion of the phenomena of electricity, and marks, along with the work of Gauss, the 
beginning of potential theory. 
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8. The Transition from Classical 
to Quantum Mechanics 


8.1 Motion of the Mean Values 


We consider a Hermitian operator L. The mean value of the operator is, as we 
know, defined as 


L=fy*Lyav . (8.1) 


Since both the operator L and the wave function w can be time dependent, the 
mean value L will in general be time dependent, too. When we evaluate the 
temporal variation of L, we can exchange differentiation and integration. This 
yields 


eo OL oy* » ~ Ow 
—L=\wyt—wdv ail *L—|dV . 8.2 
Hoe ae +i(% ee *) oe 
The first integral represents the mean value of the partial temporal derivative 
of the operator L. The second integral can be simplified with the aid of the 
time-dependent Schrédinger equation: 

ow is ou 1s 

—= --H = Hy 8.3 

ot h Oe ot Oh " eS 


If we make use of the Hermiticity of H we obtain 
= aL i se 
—E=fy*—wav + —ly* A, Li]_wdV (8.4) 
dt Ot h 

or, more simply, 


do @h I= 
—j/=—-+-|H,L)|] . (8.5 
dt Ot : nl J ) 
Taking (8.5) as a basis, we can neatly define the total temporal derivative of the 
operator dL/dt: 
WL Gl nee | 
ee ee ee (8.6) 
dt ot 7 al J 
From this definition, we see that the temporal derivative of the mean value i 
is equal to the mean value of dL/dt. The sum and product obey the usual 
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NNN 


differentiation rules: 


d dA dB 
cs es 8.7 
d (ae ®) i ey) 
d dA.  -dB 
SF) ee ey 8.8 
me ) dt zs dt Se) 


which can be seen by applying (8.6). The proof of (8.8) is as follows: 


OG Poe eens 
—(AB) = —(AB) + <LH, ABJ- 
(48) = = (4B) + oH, AB) 


0A~ .0B ee a BAO amen es 
Bp A (ie Bie eee) 
ot ot oh 
ny ee  e * 
2 BA ei le be APH Bale 
ae ae al ee se ZU ile) 
dA~ 0B 
= ee eee 8.9 
ot e ot ee) 


8.2 Ehrenfest’s Theorem 


We consider the time derivative of the coordinate (or momentum) operator. 
Neither operator is explicitly time dependent; hence, the following is valid for 
the x components: 


dx leo 3 
a le = ; (8.10) 
dp. i 7 A 
a = els - (8.11) 


For the other components, analogous expressions are valid. To evaluate the 
commutators, let us have a look at the Hamiltonian of a particle in the potential: 


a 1 A 
H => (62 + BF + P2)+ Vix,y,2) (8.12) 


The operator X commutes with p?, p; and the potential, which is supposed to be 
an exclusively space-dependent function. Thus 


z 1 I 
H ; _ See A? 5 —— A ; A = Q si A? 
eed ppeeirc) Fp ex i) = xp | 


I f hp 
= = A x a A? =e x 
mall NE a= 297M — a Soi | ee (8.13) 


8.3 Constants of Motion, Laws of Conservation 


The commutator with the component of momentum p, yields 
a x ; hav 
[H, pelle = VG. Jy, Z), pale = fae 9 (8.14) 


Hence, from (8.10) and (8.11) we get 


dX py 

dt om’ (8.15) 
LZ 

ae a (8.16) 


Thus the same relations exist between the operators of position and momentum 
as between the coordinates of position and momentum in classical mechanics: 
dx See dp. OV 
er ied en ae : (8.17) 
Evaluating the mean values of (8.15) and (8.16) and considering dx/dt = dx/dt, 
both of Ehrenfest’s theorems follow 


d . 1 , d . 0 
Ta ew ax = = [Wp dx P ae bet ax = — yr ax . (8.18) 


This statement is summed up in Ehrenfest’s Theorem (1927): The mean values of 
quantum-mechanical quantities move according to classical equations. 


8.3 Constants of Motion, Laws of Conservation 


A time-dependent operator is a constant of motion if the operator commutes 
with the Hamiltonian. Indeed, for the case of time independence we have 

Gi 0 es 

oe ee ene (8.19) 
If the operator itself is not explicitly time dependent, ie. OL/At = 0, it follows 
that [H, £] = 0. Thus only those operators L represent constants of motion 
which 1) are not explicitly time dependent and 2) commute with the Hamil- 
tonian. This fact will be very important in our further studies of quantum 
mechanics. 

The operator H of the total energy obviously commutes with itself. It is 
therefore exactly a constant of motion when it is explicitly time dependent. This 
is just the law of conservation of energy. 

The momentum is not explicitly time dependent. With respect to (8.14), it 
immediately follows that p, = constant if 0V/dx = 0. Thus the law of conserva- 
tion of momentum is valid in quantum mechanics, too. 


We 
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For central forces, the associated potential V(r) is only a function of the 


radius r. The angular-momentum operator L? = — h? Vi.» [see (4.75)] thus 
commutes with V(r). The complete Hamiltonian is [see (4.82a) ] 

Sh nr Vir) (8.20) 
hence, 

7 20 (8.21) 


Thereby, the law of conservation of angular momentum is valid (Kepler’s second 
law: the area theorem). The same consideration is true for the z component of 
angular momentum, because [L?, L,] = 0 and [H, L,] = 0. 


AS uQaa>>>E>E“L _ _ —Lh_=|=====_ 


8.1 Commutation Relations 


Problem. Show by application of the canonical commutation relation 


gle h 
[Bi 4x )- Seok , (1) 
that the commutation relations 
P hoH S hoH 
Dp =. 9 Ada =— Z 
LH, pi] i 24, eel i Op, (2) 
are valid, for Hamiltonians of the form 
H = 4,,42,...,4°,p" and (3) 
H =P Combe (4) 
k 
Solution. The proof is given by complete induction. The equation 
: hoH 
dele Di =) Remarc 
el ea ae (5) 


is obviously true for H = G;. We assume it also to be true for H = q;- Then, for 
H = g?*', the following is valid: 


fge An+ 1A a 


Gi 5 pil- = : Fe pee = (4; B; a Pia?) ar (4;B; ae B4)G; 


ie Oe 


= —-(n+lgp= --—— ; (6) 


hence, the relation is valid for all n. 


8.3 Constants of Motion, Laws of Conservation 


The proof for [H, f;]_ and for H = p; is performed analogously. 
lf H= 5 'C,,p"g", then we have 


a pron i h oq; 
een = Cah eo) C.D (-} i) 


1 Og; 


Sie hoH 
= Cai ( — di ') eae (7) 


The proof for [H, p;]_ is performed analogously. 


EYER re 


8.2 The Virial Theorem 


The virial theorem formulates a general relation between the mean value of the 
kinetic energy <|7|> and the potential V: 


2<IT =<lr- VW). (1) 


It is valid both in classical mechanics and quantum mechanics and can be 
proved similarly in both disciplines. In classical mechanics, we start with the 


temporal mean value of the time derivative of the quantity r-p, i.e. d(r-p)/dt, 
which vanishes for periodic motions. Correspondingly, in quantum mechanics 
we consider the expectation value of d(r-p)/dt and get 


d d 1 iz 
er oe, = — -p => — -p H = = 0 . 2 
( Fe |) yee = ey IEP (2) 
The last identity can be verified easily in energy representation: 

Welle p, H]-lWe> = (Wel PE — Er- ple) 


=(E—E)Welr-plwe>=0 . (3) 


The last step is based on the Hermiticity of H and the reality of E. On the other 
ase the commutator [r: p, H]_ can be easily computed for all H of the form 
= p*/2m + Vir): 


A? A? a? 
FT A x SE r z 
[r-p, #]- = | , + yb + zp, = + WOs y, a] 


Ali. Scaphcccte eigenen 
Sa ns + p,) in( ay nas 
= 2ihT —ih(r-VV) . (4) 


Thus we have oa = <|rVV(r)|>. We note that it does not matter for the 
proof whether we start from r- p or p-r, because the difference between the terms 
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8. The Transition from Classical to Quantum Mechanics 


is a constant which obviously commutates with H. If Vis a spherically symmet- 
ric potential, e.g. V(r) ~ r’, the virial theorem yields 2¢|7T|> = n<|V|>. This is 
valid for all n, where the expectation value <|V|> must exist, of course. 


8.4 Quantization in Curvilinear Coordinates 


Equation (8.6), which gives us the total time derivative of an operator F 
(8.22) 


has a formal analogue in classical mechanics, the Poisson bracket. The total 

temporal derivative of a function F(p;, q;, t) 1s given by 
do Oke mecr 
es a Dy a i 
ie ee 

Here, p,, q;, are the generalized momenta and coordinates, respectively, and fis 


the number of degrees of freedom. By using Hamilton’s equations, the second 
term on the right-hand side of (8.23) transforms to 


I (OF OF ) J (= OH OF | 
LS li ab, — i): =e SS | ———— = —— ——. = 1? H ; 
& (Fa Op; » 0q; Op; Op; Oq ee 


(8.23) 


and, hence, (8.23) becomes 

dF oF 

ee a ee \ 

Amar cay + {FH}, (8.24) 
The analogy between the classical equation (8.24) and the quantum-mechanical 
equation (8.22) is obvious. The so-defined term {F, H}, involving the Hamil- 
tonian function H, is called the Poisson bracket. The transition from classical 
mechanics to quantum mechanics can obviously be performed by the transition 
to operators and the replacement of the Poisson bracket {,} by the commutator 
(1/ih) [, ]_-. The operator (1/ih) [F, H]_ is also called the quantum-mechanical 
Poisson bracket. 

These considerations of analogy can be continued. In classical mechanics 
we work with canonical variables and say a transformation from q,, p; to Q,, P; 
is canonical if Q; and P; again fulfil Hamilton’s equations. This means the 
transition 


Hp; q;) ae H (Pi, 0; ] ’ (8.25) 


where # stands for the new Hamiltonian function depending on the coordi- 
nates P; and Q;. The same statement can be expressed by the Poisson bracket; 
indeed 


{4p Pi} = 0 (8.26) 


8.4 Quantization in Curvilinear Coordinates 


because the latter is equivalent to 


Lui iis, Gig etry hip 
(sie oe = i oe) =o, . (8.27) 
pa 04, CP.) OP, 0g, 


The term (0q,/0p,)(Op,/Oqg) always vanishes and (0q,/0q,)(Op,/Ops) gives the 
identity only for i = j, hence, 0,;- If we transform to Q,, P;, the transformation is 
then only canonical if 


Oe oe (8.28) 


is valid. (This will be further explained in Exercise 8.4.) Furthermore, the 
following equations also hold: 


{0;,0;}=0 and {P;,P)}=0 . (8.29) 


Proceeding to quantum mechanics, we get the same relation, provided that we 
set p; = — ihd/Ox; in the above-defined quantum-mechanical Poisson bracket: 


ih 0 
a af =|. => Oij Q (8.30) 


Thus the momentum is replaced by the operator. Likewise, both of the above- 
considered relations (8.29) are valid in quantum mechanics: 


(x;,x;]- =0 and eae =@ . (8.31) 
Moreover, for the classical Poisson bracket, the following relations are valid: 
eB BA A, C}— 0) for 8 — const, 
& Alg Bley 18) = 4 Ali 18a Aa ce Blea, Bh. 
Vane 4, BA By AS, Dt 
{A, {B, C}} + {B, {C, A}} + {C, {A, B}} =0 , (the Jacobi identity) 
(8.32) 


It can easily be checked that the quantum-mechanical commutator fulfils the 
same algebraic relations. This fact was first noted by P.A.M. Dirac, who used it 
to show the formal analogy between quantum and Hamiltonian mechanics. 
The transition from classical to quantum mechanics can formally be 
achieved by a special canonical transformation with the commutators 


Lae x;]- = ihd; : [as xl =0 2 bps ple =0 . (8.33) 


A short example shows, nevertheless, that one must take care with this 
transition, now that the momentum is an operator. The terms 


1 1 
p*,—PXP,—Pxpx , ete. (8.34) 
x x 


are equivalent in classical mechanics. If, however, we replace the momentum by 
the operator p = — ihd/0x, we get different terms in quantum mechanics. 
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Similar difficulties arise if we use curvilinear coordinates. The special nature 
of Cartesian coordinates becomes clear when considering kinetic energy. The 
kinetic energy in generalized coordinates is of the form 


3 
T= » Mik(41> 92> 93)PiPy (8.35) 
ik=1 

The mass coefficients m;, are generally functions of space. Hence, if the mo- 
mentum is measured, they cannot simultaneously be exactly determined. Here, 
this means that the kinetic energy cannot be determined by measuring only the 
momenta. In Cartesian coordinates for a particle of mass m, the coefficients obey 
the relation mj, = 1/2m6,,; thus 


p2t+p.t+p2 
2m 


i : (8.36) 
and the kinetic energy is determined exclusively by the momenta. Of course, the 
form of the kinetic energy is essential in determining the Hamiltonian operator 
(Hamiltonian). To obtain the Hamiltonian operator from the Hamiltonian 
function, it is always necessary to transform the function into Cartesian coordi- 
nates before putting in the operators. This is the safest way to pass from the 
classical to the corresponding quantum-mechanical system. 

We now show the different outcomes of the two procedures, taking a central 
potential as an example. 


Correct Way 


We consider a central force problem (e.g. the hydrogen atom) with the potential 
V(r) = —e?/r. The Hamiltonian function in Cartesian coordinates takes the 
form H = p?/2m + V(r). We replace the momentum by the operator — ihV and 
obtain the Hamiltonian 


OS = pee (8.37) 


Now we must transform the A operator into the spherical coordinates r, 3% ¢, 
which are appropriate to the problem. The outcome of this computation is well 
known and leads to the Schrddinger equation 


“ he) lidaere 1 0 
i ee 
: sea Pa eee 2 sin J mass year aN) 
+ ,—— ae: E 8.38 
Posin- o OD! EE (eke) 


Incorrect Way 


We start from the classical Hamiltonian function and transform it from Car- 
tesian to spherical coordinates. Thus we obtain the Hamiltonian function: 


1 1 | 
H= sof + 3P3+ aaa ares) chal ee (8.39) 
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Now, performing the transition to quantum mechanics with the transformation 
equations: 


[P.r]= ik , p= —iha . 
or 

[ Po, F] = — 1h a ee : 

> > Pg be ag 3 

“ : F ae, 
[b,¢]= -—ih , p= res ; (8.40) 

and noting the term for the quantum-mechanical Hamiltonian, 

~ Lje le? 1 6? 

H = — | —-—— —_- —_—————- y 
a(S ue O¥ [= sin? g =) gd) om) 


we recognize that the outcomes of the two procedures differ; in the second case, 
we have lost some terms during the transformation. 

The transformations of kinetic energy operators from Cartesian to curvi- 
linear coordinates seem to be very awkward, and there may be cases in which it 
is impossible to write down the kinetic energy in Cartesian coordinates. Thus 
the question of how to proceed in such a case inevitably arises. 

In the following, we explain a method by which, starting from an arbitrary 
system of coordinates, we can obtain the correct term of the quantum-mechan- 
ical Hamiltonian. 

We consider a system of N particles with 3N degrees of freedom. The 
Cartesian coordinates of the particles are x1, X2,...,X3n. The associated Car- 
tesian momenta are denoted by p,, p2,...,P3y- Thus the classical Hamiltonian 
function is 


ey a an) (8.42) 
m 


The kinetic energy occuring therein is of Cartesian structure, Le. it is diagonal in 
the momenta and has constant mass factors 1/2m. On the basis of our experience 
with the single-particle system [e.g. the hydrogen atom ~ see (8.36) and (8.38) ], 
we are able to transform this kinetic energy into its quantum-mechanical form. 
The operator of the kinetic energy is 


2 he (ie 6° 
Cae | > 8.43 
i mot 2m & ae: On oe 


where the 3N-dimensional Laplace operator (Laplacian) is written in brackets. 
It should be made clear that this is a model which is indeed justified by our 
experience. 

Analogously, we now consider curvilinear coordinates (i;, U2,-..,43y). The 
square of a length element in 3N-dimensional space takes the form 


3N 
AG ei ety (8.44) 


i le= i 
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a b 


Fig. 8.1a,b. Surface oscilla- 
tions of a nucleus. At (a), 
small amplitudes, fewer nuc- 
leons participate in the 
motion than for (b), large 
amplitudes. The vibrating 
mass thus becomes ampli- 
tude dependent (coordinate 
dependent) 


The coefficients gj,(u;) are the elements of the metric tensor; in general they 
depend on the coordinates u;. The kinetic energy yields through 


m(ds\? 
eee 8.45) 
Gl ( 
the relation 
mn du; du, 
peat pees 8.46 
2, me Sik dt dt Wad 


Thus the g,,(uj) are a kind of mass coefficient. We call the determinant of the 
matrix (g,,) det(g,,) = g, and the inverse of the matrix (gx) ' = (9). 
In curvilinear coordinates, Laplace’s operator A3y is’ 


1 


3N Fi fy e 
4=—e Daal Vittan) 


Using this form of the Laplace operator we always obtain the correct operator 
of the kinetic energy in the transition to quantum mechanics, namely 


(8.47) 


ee age (8.48) 
i 2m. /g Lau, a8 Ou, , 
This general method of quantization is especially important in nuclear physics 
for the quantization of collective phenomena.” In the case of a vibrating nucleus, 
for example, we are dealing with a system the mass of which (vibrating mass) 
depends on the amplitude of the oscillation. The greater the oscillation, the 
more nucleons participate in the motion (cf. Fig. 8.1). The metric tensor (mass 
tensor) becomes coordinate dependent and the quantization (8.48) is of decisive 
importance. 


DOAN) aS ESS SSE 


8.3 The Kinetic-Energy Operator in Spherical Coordinates 


Using the example of spherical coordinates, we once again derive the kinetic- 
energy operator (8.38), but now from (8.48), which applies to all coordinates. The 
length element in spherical coordinates is 


ds? = dr? + r7d¥ +17 sin? Idp? = Vg,(du, . (1) 


" See, e.g. M.R. Spiegel: Vector Analysis (Schaum, New York 1959). 


2 See J.M. Eisenberg and W. Greiner: Nuclear Theory I: Nuclear Models, 3rd ed. (North 
Holland, Amsterdam 1987). 
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Thus we obtain for the single elements of the metric tensor 


iio — 7 eno. — 7 sino (2) 
forming the diagonal matrix 
1 0 0 

(Gg) =|0 i 0 : (3) 


0 0 r? sin? 9 
The determinant and the inverse matrix are easily evaluated: 
det(gix.)=g=r'sin? and 
1 0 0 
(g")={0 Ir? 0 (4) 
0 0 1/r? sin? 9 
As all g;, = 0 for i # k, the kinetic energy operator consists of only three terms: 


witniee d/g8 a Jo 2) 


0977 08 dor’ sin? 9 de 


oe is tage 1 dee i te? 5) 
= ér? or ér or’ sin 9 09 ag PAGS Be 


rs 
| 
|= 


This result agrees with (8.38), which was obtained by first making the transition 
from classical to quantum mechanics in Cartesian coordinates and then per- 
forming the transformation to spherical coordinates. 


EXECS 


8.4 Review of Some Useful Relations of Classical Mechanics: 
Lagrange and Poisson Brackets 


Consider a transformation from a set of space and momentum coordinates q,, 
p, to a new set Q;, P;, where 


Q: = O49; Ppt) » Pi=Pilqy Ppt) - (1) 


This transformation is called canonical if a function #(Q;, P;, t) (Hamiltonian) 
exists, such that 


H(q;, P;)) > #(Qi, Pi) - (2) 
H=Y,pq,;— L and # =y;,P;Q;— L’ are the relations between the Lagran- 
gian L and H, and L’ and #, respectively. In short, 


ee. : OF? 
pase SON Perse o (3) 
C=] P, 


Example 8.3 
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Exercise 8.4 


Poincare’s Theorem states that the following surface integral is invariant under 
canonical transformations: 


hence, it easily follows that the Lagrange bracket is invariant under canonical 
transformation. 


0q, 0p, Op, 0q, 
= Bee Sia ee ; 4 
Lim OF >( | OL OL @ 
The Poisson bracket is defined by: 
Ou dv du Ov 
— ee a . 3 
— ae, Op; OP; A o 


Problem. a) Prove Poincare’s Theorem. 

b) Show the invariance of the Lagrange bracket under canonical transforma- 
tions. 

c) Show the invariance of the so-called fundamental Poisson brackets under 
arbitrary canonical transformations: 


{pp} =9 , 

(4 Gj} = 0 

{dis Bj} = 95 (6) 
d) Verify the relation: 

Gi Con (7) 


for two arbitrary functions F and G, i.e. the invariance of the Poisson bracket 
under arbitrary canonical transformations. 


Solution. a) The position of a point on a two-dimensional area A in phase space 
is completely determined by two parameters, u and v. On this area we can 
express the coordinates gq, and p, as functions of u and v, since q,; = q,(u, v), 
Pp; = pu, v). With the aid of the Jacobi determinant, the area elements du dv and 
dq,dp, can be transformed into each other; the Jacobi determinant being 


| 04; Op; 
O(4;,P;) __ | Ou Ou g 
(u,v) | 0a, Op, e) 
Ov Ov 
The two surface elements are related by 
O(d;, D; 
Fe CaN ai! (9) 


ee 
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le. the statement that J, has the same value for all canonical transformations, 
§{Xda;dp;, = Ji = SJ dQ.aP, (10) 
Ai Ak 


can be expressed with the aid of (9) as 


0 > ie 
ie, eee du dv aay EO ae (11) 


Thus the proof is reduced to the statement 


o(q., 0(O,, P;) 
rere yee 


F = ele) 


(12) 


i.e. both Jacobi determinants are identical. Now we consider the canonical 
transformation q, p > Q, P to be achieved by the generating function F,(q, P, t), 
for which 


GF, oF, OF, 
pe oe. ST 13 
is valid. 
For the calculation of the functional determinant we first need: 
0 (OF. 
CaS (14) 
du du aq; 


where we have used (13). Owing to its definition, F, depends only on q, and P,; 
here, time acts as a parameter, not a coordinate. With the aid of the total 
differential for F,, we get 


OF, OP, 0? F, 04), 


a 
paesty LS 
a GR a | OG -) 


An analogous result is obtained for ép,/év, so that the functional determinant 
assumes the form 
0q; Gey Oe ae 64, 
alas Pd _ du? 20q,0P; au * 42q,0q, Ou | WG 
f lu, v) i ; Ch OP, Ors 04, 
av” L5q,aP; ob 704,04, Ov 


After application of the rules of calculation for addition and multiplication of 
constant factors with determinants, this is transformed to 


64; 04, 0q, OPs 
di Po) OF, | Ou du xp EO au | 
7 O(u,v) — 7409;04, 0q; 64), 7409; 0P, | Og; OP. 
ov ov Ov dv 


(17) 
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It is apparent that the first term is antisymmetric with respect to changing the 
indices of summation i and k, because this only involves changing two columns. 
Thus the first term vanishes and can be replaced by another vanishing term: 


oP, OP, 04; OP, 

a(q;, a?F, |éu du 62F, |du du. 
yey 2) = 2 (18) 

(u,v) 7, aPOP, OP, OP.) 50g; OP, |0q; OP. 

év dv ov @v 

If (18) is transformed into the form of (16), the element a,, reads 
2 Pe 2 . 

5) F, OP; 0 F, 0g; = 4) OF, (19) 


LSP oP, éu 254,0P, Ou ~ Ou OP; 


Here, contrary to (16), we have moved the sum over k in front of the determinant 
and the sum over i into the determinant. Because of (13), it holds in (1) that 


OF, . 
aR =, hence, 
20, oP, 
(4; P;) Cu ou O(O,, Pr) 
E E — —a —— » 
2 au0) ~¥| 20, aP,|7 > au 0) ee 
Ov Ov 


Thus, together with (12), Poincare’s statement is proved. 
b) Since we have already verified Poincare’s statement in part a), the state- 
ment in (4) holds: 


Oi» Pi) _  O(Qi, Pi) 
Datu 0) v) = Flu, a) O(u,v) 


It is identical with the just proved relation (20) for the Jacobi determinants. 
Indeed, rewriting this expression as 


(21) 


dq, Op, ap, Oq, 60, 0P, OP, é 
“D(A -2)-3(Ba SF 2.) ’ ee 
we see that it is equivalent to the invariance of the Lagrange bracket: 
(1% }snq= {te r}}ro - (23) 
c) First we show a useful relation. Let u,, /= 1,2,...,2n be a set of 2n 


independent functions such that every u; is a function of the 2n coordinates 
Gis is --0s ns Piso pe Uhetithe relation 


F lun} Han) = 3, (24 


is always valid. According to the definition of the Lagrange and Poisson 
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brackets, it follows that 


> (a 1:3} {u, uj} = aw Ene - oe 


kom 


Ou, Cu; Ou; Ou 
Se nt eee (23) 
84m OPm — OF m OP 


The first term can be transformed to 
n 


my = 


Suri). Cay 
k,m Ou; OP m I Ou; O4 ny k,m Ou; OP in 04m 


ene 


Op, Ou; " Ou; Op 
= = ee 
Pian ae onan (26) 
The last term in (25) can be evaluated in the same way: 
" Oq, Ou; ) (» Op, | ", du; dq, 
ps ae (cu, cn.) = Lan | (27) 


so that the sum of the terms in (26) and (27) yields a total differential in u;: 


= ( Ou; Op, | Ou; dq ou; 
— J Sa Ss \\ oe ce Fee 
zg Ou; 04, sh | OU; oy (es) 


The second and the third term of (25) always vanish; we show this as an example 
for the second term: 


wcguad; \ {Ope au,\ — 
(3 Ou; | oueqga | © Oe i) 


k,m 
because 
2n OP, 
asf . (30) 
d 04m 
Thus we have shown that 
2n Gu: 
Di (lun wih} (Un Uy} ZS Oy (31) 
I i 


We note that up to now, the choice of the coordinate system has been irrelevant. 
Hence, (31) is valid for all coordinate transformations, not only for canonical 
ones. The latter quality serves to evaluate several Poisson brackets, without 
committing ourselves to a certain system of coordinates. 

For the 2n independent functions u;, we choose bie Set odan ee Gay PiscceeDn 
and consider in particular the case u; = q;, Uj = Pj. 

Thus (24) or (31) (which are the same) yields 


> (pp Ga Pubs) (4p gS}idp Py =O » (32) 


because 6u,/du, = 0q;/0P; = 0 for all i, j. As was shown in part b), the Lagrange 
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bracket is invariant under canonical transformations. We will use this property 
in order to show the invariance of the Poisson brackets appearing in (32). 
The following expressions are thus invariant: 


({Pp at} = — 6, and {{qpqp}=9 . (33) 
Inserting these into (32) the second term vanishes and we are left with: 
{Pp Pf =O . (34) 


Since (32) is valid for all transformations, the same applies for the Poisson 
bracket above. 
Choosing u; = p; and u; = q, in (31) we obtain 


{q,4}=9 , (35) 


which is again valid for all transformations. Choosing u,; = q,, u; = q; 1n (31) 
yields 


D(a Gn 4)} + YUP. PAW iach on, (36) 


Since the first term vanishes, the second expression has to satisfy 


= OR Una) = Oy (37) 
I 
which is only possible if 
{qi P;} =O; . (38) 


Thus the invariance of the fundamental Poisson bracket under arbitrary 

canonical transformations is shown by use of the invariance properties of the 

Lagrange bracket. In particular 
{dis Di} = {Qi, Pj} = Oy, ie. 


i= OG, Op, Op, Og, 


d) For two arbitrary functions F and G, the Poisson bracket is defined with 
respect to the set q, p in the following way: 


OF 0G OF OG 
F,G}a.p= Ge x a 
co ~ 0g; 0p; Op; 0q; 


i=1 0Q,0P,; OP, 0Q, 


(40) 


The q, and p, are functions of the new variables Q; and P,, respectively, and vice 
versa. Hence, the function G can also be expressed in terms of Q;, P,. This 
possibility is now used by transforming (40) into 


OF (dG 00, 0G Ze OF ( dG 40,  aG 2) 
LG = —|= = + - — ae 
bar Shales Op; OP, op; Op;\ 0Q, 0q; OP, 0q; 


(41) 
A clever reshuffling of several terms leads to: 


0G 0G 
ee Ban EGE Qian + sp th, ae : (42) 
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Replacing F by Q; and G by F similarly yields Exercise 8.4 


{Qi JP hope = Sn 5, (Qe Oe pot Sie 71 Or ha, p © (43) 


As we have only wn Poisson brackets left [cf. a)], we can start to evaluate 


OWE. OF 


oie eer ee 
{Q, ae Lop, Jk oP, 


(44) 


Here the analogous relations to (35) and (38) for Q, P are used (the invariance of 
the fundamental Poisson brackets!). Substituting P; for F and F for G, on the 
other hand, yields 


(Pu Plan= Dap ae pO) + ES FP PA , 


thus 


OF ae 
On aghew” "ar, 


Substitution of the results (44) and (45) into (42) finally gives 


OF 0G OF OG 
eh ye = = (pp 
{ J34,P 5( OP, ar, a} ; > Gho, p (46) 


Thus we have demonstrated the invariance of the general Poisson bracket under 
canonical transformations. 


Ue Ee (45) 


8.5 Biographical Notes 


EHRENFEST, Paul, Austrian physicist, *Wien 18.1.1880, tLeiden 25.9.1933. E. was 
a professor in Leiden (Netherlands) from 1912 on. He contributed to atomic physics with 
his hypothesis of adiabatic invariants (BR). 


POISSON, Siméon Denis, French mathematician, *Pithiviers 21.6.1781, +Paris 
25.4.1840. P. was a student of the Ecole Polytechnique and he was employed there after 
completing his studies, being a professor from 1802. P. was a member of the bureau of 
lengths and of the Académie des Sciences. He was a French peer from 1837. P. worked in 
many fields, e.g. general mechanics, heat conduction, potential theory, differential equa- 
tions and the calculus of probabilities. 


POINCARE, Henri, French mathematician, *Nancy 29.4.1854, tParis 17.7.1912. P. 
studied at the Ecole Polytechnique and became professor at Caen in 1879, later, at Paris. 
He produced more than 30 books. At the turn of the century he was believed to be the 
outstanding mathematician of his age. P.’s greatest contribution to mathematical physics 
was a paper on the dynamics of the electron (1906) in which he obtained, independently of 
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Einstein, many of the results of special relativity. Einstein developed the theory from 
elementary considerations about light signalling, whereas P.’s treatment was based on 
the theory of electromagnetism and was thus restricted. P.’s writings on the philosophy of 
science were as important as his contributions to mathematics. He became a member of 
the Académie Francaise in 1908 (taken from Encyclopedia Britannica, 1960 edition). 


LAGRANGE, Joseph Louis, French mathematician, *Torino 25.1.1736, tParis 
10.4.1813. L. came from a French-Italian family, and in 1755 became professor in Torino. 
In 1766 he went to Berlin as the director of the mathematical-physical class of the 
academy. In 1786, after the death of Frederick II, he went to Paris, where he considerably 
supported the reform of the measuring system, and where he was professor at several 
universities. His very extensive work contains a new foundation of variational calculus 
(1760) and its application to dynamics, contributions to the three-body problem (1772), 
the application of the theory of chain fractions to the solution of equations (1767), 
number-theoretical problems, and an unsuccessful reduction of infinitesimal calculus 
to algebra. With his “Mecanique analytique” (1788), L. became the initiator of analytic 
mechanics. Important for function theory is his “Théorie des fonctions analytiques, 
contenant les principes du calcul différentiel” (1789), and for algebra his “Traité de la 
résolution des équations numériques de tous degrés (1798). 


9. Charged Particles in Magnetic Fields 


9.1 Coupling to the Electromagnetic Field 


If a charged particle of charge e moves in an electromagnetic field, the Lorentz 
force 


p=e(E+ xa) (9.1) 


acts on the particle. The electric and magnetic field strengths can be expressed 
by the corresponding potentials A(r, 1) and (r, t) according to 


10A 


Here, A(r, f) is the vector potential and ¢(r, t) the Coulomb potential. In classical 
mechanics, this motion is described by the Hamiltonian function 


H=5(p-£A) +66 (9.3) 


which will be shown in Exercise 9.1. This indicates the simplest way of coupling 
the electric field to the motion of the particle. The momentum p is replaced by 
the term p — (e/c)A. The substitution p — (e/c)A is gauge invariant and is called 
the minimal coupling. Hamilton’s canonical momentum p is the sum of the kinetic 
momentum mv and the term (e/c)A, which is determined by the vector potential. 
Thus 


e 
p=mv+-A . (9.4) 

c 
The transition to quantum mechanics is obtained by replacing the canonical 


momentum p by (h/1) V, according to the rules of quantization in the coordinate 
representation (see Chap. 8). Thus we obtain the Hamiltonian 


ee (79 A‘) +66 (9.5) 


2m\1 


Calculating the square, it should be noted that, in general, the gradient and 
vector potentials do not commute. We get 
, 2 h 
fa = 2 oye 
2m 2imc 


2 


e 
en +ed , 
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a i i 2 
H= a ee ee (9.6) 
2m mc 2mc 2mc 
It is well known that the electromagnetic potentials A and ¢ are not unique, but 
are gauge dependent. Particularly in the Coulomb gauge, it holds that V- 4A = 0; 
thus the third term vanishes. If we change the order of the terms and use, for the 
sake of clarity, the momentum operator p, we obtain 


a2 a 
es Pp e€ ye 2 
H=— ——A Ae 
Tee 2mc? 
mn a e2 5 


Here, the operator He represents the motion of the particle without a magnetic 
field; the coupling of the motion of the particle to the magnetic field is given 
by the product A-p. The third term depends only on the A field; for normal 
field strengths of small magnitude, it can be dropped. If the vector potential A 
describes a plane electromagnetic wave, the coupling terms in (9.7) lead to 
radiative transitions (emission and absorption). The states of the particle in an 
electromagnetic field are given as solutions of the Schrodinger equation with the 
Hamiltonian derived above in (9.5): 


a — (e/c)A}* 


2m 


i. coh = inc (9.8) 


We can check that Ehrenfest’s Theorem is also valid for this Schrodinger 
equation for which we shall now prove the gauge invariance. Gauge invariance 
means that the solutions of the Schrédinger equation describe the same physical 
states if we apply to the potentials the transformations 
1 of 

A =A+Vf(r,t) and @ ey A) (ee) 
with the arbitrary function f(r, t). Using the four-component relativistic notation 
by introducing the four-vector A,, these transformations read 


A 


, ig g 
Al, = A + ax! ; with 
A, = {A,i@} and p= 1,2,3,4 (9.10) 


where X= x, X° =x =z x ler, 
If we denote the Hamiltonian with primed potentials by A’, the correspond- 
ing Schrodinger equation becomes 


~ 0 
AW = ih—w. 
oS a Wy abn 
Our statement now is that and w’ differ only by a phase factor. If so, the gauge 


transformation does not change the physical quantities, because, during their 
calculation, only products of the form y*w or matrix elements <w |... lWy> in 
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which the phase cancels occur. We start with 
; ie 
W' = wexp(— Siro (9.12) 


and insert this in (9.11), which thus becomes 


ae Ae Ove ily i a ie 
ae Ba Sve) ves (FEF) +(c ~£F vex) 


ey ie e of ie 


We can easily see that 


Ce eee oe e e ie 
(0~ Ea)W = Gv -E4— eu) venr( Es) 


ié h e e e 
= exe(FF) Vee 4 - ‘wy 


i 


ie h e 
= V A : 
ep (Es) (; ; )y (9.14) 
Applying the operator [ p — (e/c)A’] once again, we obtain the equation 
n ., Ow 
Ay =ih— . 
vet Be (Ons) 


In other words, (9.15) follows from (9.11) by using (9.12). This outcome shows 
us that the solutions of the Schrdédinger equation (9.8) still describe the same 
physical states, even after gauge transformation. The states wy, and yw}, differ only 
by a unique (i.e. state-independent) phase factor exp [(ie/hc) f(r, t)]. The physical 
observables are not affected by this as mentioned above. It is clear that it is not 
the canonical momentum p-— —ihV (the expectation value of which is 
not gauge invariant), but the genuine kinetic momentum mv — ihV — (e/c)A 
(which is gauge invariant), that represents a measurable quantity. 

Hence, if in a physical problem the momentum operator p appears, the 
operator p must always be replaced by p — (e/c)A if electromagnetic fields are 
present. This is the only way to guarantee gauge invariance in quantum theory; 
otherwise, certain potentials A and ¢ could be determined in quantum mechan- 
ics, and this should not be possible! 

We shall now summarize once more the principal idea of gauge invariance in 
quantum mechanics, in relativistic notation. The gauge transformation for the 
electromagnetic fields A,(x,) is 

of 


Ae ee with 
if ae 


A, = {4,if} and x, = {x, ict} . (9.16) 


This leaves the electromagnetic observables, i.e. the field strengths E and B, 
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oe ee eS eel 


unchanged. The four-momentum operator is given by 
Cue ec iE 
2p ee eee (9.17) 
ue ; i Ox, 0% at ip C 
and minimal coupling is achieved through the replacement 
etn — el (9.18) 


In quantum mechanics, the gauge transformation (9.16) must be supplemented 
by the phase transformation of the wave function 


We) = Woy x7 (FEF) (2.19 

so that 
Beg Ny | pee ee eee 
(°, = “4,0 = (?, a a ; =| exp (F5 Pose vos 
= exp (Er) (?, - “4, ) Hts) (9.20) 
Cc Cc 

holds. Then we can be certain that observables of the type 

Ww VOW Wily? = bX )IV Ow) IWiX,)> and (9.21) 


W ion F( = <4; Wilx,)) = Cpls NIF(P = =A, | Pils)? 


are unchanged by (i.e. are invariant under) gauge transformations. The equa- 
tions in (9.20) are exactly the right-hand side of the former equations (9.13) for 
pe = 4 and (9.14) for np = 1, 2,3, respectively. The following examples and exer- 
cises will further clarify this discussion. 


EX ACE 


9.1 The Hamilton Equations in an Electromagnetic Field 


Let 41, qa, -»-5 9s, +», Gy be the generalized position coordinates determining the 
configuration of the system, and py, po, ..., Ps,-.., py the canonical conjugated 
momenta. The Hamiltonian (in classical mechanics we prefer to call it Hamil- 
tonian function) H is a function of those position coordinates and momenta, 
and, in general, of time tf. 

Hamilton’s equations are, as we know, 

dp, 0H dq, OH 


Wa wan aoa (1) 
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The derivative of any function F(q;, p;,t) of the generalized coordinates, mo- 
menta and time, with respect to time, is 


We CR Ee ee 
pS nee 


s=1 s=1 


(2) 


Using Hamilton’s equation (1) we can transform (2) into the following form: 
dF OF 


Ape ap ss , (3) 


where {H, F} is equal to 


ue 4 
tH Fy = ¥ OF GH 0H OF | 
6q, Op, 0g, Ops 


s=1 
and is the so-called Poisson bracket [cf. (8.14)]. 


Obviously, Hamilton’s equations (1) can now be written 


dp. dq 
ae A, . ; = ide at 
Whe { > Dss dt { Ys5 
Sealy Doce (5) 


[we have only to set F = p, and F = q, in (3)]. 

In Chap. 8 we learned that in quantum mechanics, equations of motion are 
written in an analogous way. In the special case of a Cartesian system and of 
a particle in a field derivable from a potential function V(x, y, z,t), we have 


_ Px t Py +P: 
2h 


where q, =X, G2 =), 43 = 2, and pi = Px, P2 = Py, and p3 = pz. With (5) we 
obtain 


awe + V(x,y,z,t) , (6) 


an. dH oa 

=_ = Yo — —= — — 

dt UH, Ps} Ox ax” 

dx oH . 

a= 1H i= =a 7 
Te 7 a ee aa u 


The equations of the other coordinates and momenta can be obtained in the 
same way. From (7) we get 
ae AV 


AGEN 9 Ei evap 8 
ME ae ) 


i.e. Newton’s equation of motion. 

Consider now the motion of a charged particle with charge e and mass pu in 
an electromagnetic field described by a potential g = (1/e) V(x, t) and a vector 
potential A, so that 


E=-—Vo--~— . (9) 
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B— curl A (10) 


where E and B are the electric and magnetic fields. In this case, the Hamiltonian 
function can be written as 


= (p-£4) ted (11) 


Indeed, we will show that the Hamilton equations that emerge from this 
function, 


dp. —-O#H dp, 0H dp, oH (12) 
do. Ge dn a 

5 H H 

dx OH dy 0 dz 0 . (13) 


dt Op, ° dt dp, >’ dt Op, 
are equivalent to Newton’s equations for the same particle under influence of the 
Lorentz force: 


(14) 


Inserting H from (11) into (12) and (13), we can write after derivation: 


(eee e OA, e 0A, 
mee hee ie — y — — 445 * 
dt <|(o @ . Ox ze (0 “4, Ox 


&. (°. = =A.) | sy ee (15) 


a e 
ee) (16) 


This implies that 


dp. _ d'x  edAy 
i ai. (17) 


9.1 Coupling to the Electromagnetic Field 


195 


Thus (15) can now be written in the following form: 


yt 4 eds _ ede 0A, edydA, edzdaA, 0g 
Ue epee, ad) Gx cdi Ox Ox 


(18) 


Since the value of the vector potential A is obtained at the position of the charge 
e, the total derivative of A, with respect to time is 
dA, 0A, @GA,dx GA,dy GA, dz 


a Ge ° 8 re 4 wu) 


After inserting into (15) and (16) the values of [ p, — (e/c) Ax], [py — (e/c) A, ] and 
[ p: — (e/c)A,] and of dp,/dx from (17), we find with the help of (19) that 


aex e 0A, CO ce Nay GA, “OAN “dzfoAs oA, 
el ure ocr lore = =F — 
dt Caer Gy e\dt \ @x oy dt \ 0x Oz 


(20) 


Here we can use the formulae (9) and (10), which connect fields and potential, 
to get 


ax efdy dz 
—— E =| /},. = = , : 
es +(2 a dt B;) oe 


This is the first of the equations in (14); we can derive the other two relations in 
the same way. We thus see that the Hamilton equations (12) and (13), resulting 
from the Hamiltonian function (11), are equivalent to Newton’s equations (14). 
The potentials A and ¢ can be chosen at will, if only formulae (9) and (10) lead to 
the required electromagnetic field. Using A’ and ¢’ instead of A and @¢, where 

aden ee ae (22) 

ect 
and fis an arbitrary function of the position coordinates and time, we get 
E’ = Eand B’ = B. When replacing A and ¢ in the Hamiltonian function (11) by 
A’ and ¢’, we get the equation of motion (20), with A and @ replaced by A’ and 
g’, i.e. with the same equations (14). Thus, using (22), we have shown that 
equations (14) are independent of the choice of the potentials. This property of 
Hamilton’s equations is known as gauge invariance. 

Note that the Hamiltonian function H is changed by the transformation (22) 
in contrast to the equations (14). For instance, the motion in a homogeneous 
constant electric field oriented along the x axis can be described by the poten- 
tials A = Oand @ = — Exas wellas, for example, A’ = ( — cEt, 0,0) and ¢’ = 0, 
according to (22). It can easily be verified that both choices lead to Newton’s 
equation of uniformly accelerated motion, but in the first case, the Hamiltonian 
function represents the total energy of the particle, and in the second, it 
represents the kinetic energy. 
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9.2. The Lagrangian and Hamiltonian of a Charged Particle 


Problem. Determine the Lagrangian and the Hamiltonian of a charged particle 
in an electromagnetic field. Use vector calculus as far as possible. 


Solution. The effect of the electromagnetic field on a charged particle can be 
described by a velocity-dependent generalized potential. 

Starting with the Lorentz force, we determine this potential, and the Lagran- 
gian and Hamiltonian functions. The Lorentz force takes the form 


ra (E+ 2xB) (1) 
( 
We can express the electric and magnetic fields by the potentials 
1 0A 
E= —Vd@-—-— , B=VxA . (2) 
re! 


Insertion of (2) into the Lorentz force (1) yields 


Fa e(- vo 224 Lexi xa)] : (3) 
We can use the relation 
Bx(VxC) = V(B:C) — (B-V)C— (C:V) B-— Cx(Vx B) 
to transform the triple vector product 
vx(VxA)=V(v-A)— (v-V)A_ , (4) 
since the velocity v is not an explicit function of the position. 
The total derivative of the vector potential with respect to time is given by 
te Ak wvya : (5) 


The first term is the explicit change of the vector potential with time; the second 
term stems from the fact that the position at which the value of the potential is 
obtained changes because of the particle’s motion. 

We now replace the vector product (3) by the relations (4) and (5) and get 


Fae| —vbe i ve-a)— 2] . (6) 


To deduce the generalized forces Q; from a velocity-dependent potential 
U(q;, 4:), we rely on the Lagrangian formalism, where the relation 


= oud aU 
‘0G, de oa. 7) 


9.1 Coupling to the Electromagnetic Field 


is valid. For comparison with (7), we transform 
dA ad 
—=—V{(A° 


where V, signifies the derivative (gradient) with respect to the three components 
of the velocity. We take, for instance, the x component and compare (6) and (7) 
using relation (8): 


3) e d @ e 
a =e ae ee =A \ 
: Z(e6 c” A) +5 av, (<0 a 4) 0) 


Since the electrostatic potential @(r, t) is independent of velocity, we were able to 
add it to the last term. Hence, we get the generalized potential, 


Ceo f (10) 
C 
Using L = T — U yields the Lagrangian 
ji ee ane, : (11) 
é 
and, in the form of generalized coordinates, we have 
1 3 ew. 
L= 5m) di ~ ef (4i) + Gi Ai (12) 
The canonical momentum is given by 
O18 e 
i==a 1, + —A; (13) 
ale 
or, in vector form, 
e 
p=ne A (14) 
c 
Now the Hamiltonian can be derived from the Lagrange function L by 


It has the form 


where the velocity is replaced using (14). 


Loe 


Exercise 9.2 


198 


9. Charged Particles in Magnetic Fields 
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9.3 Landau States 


Problem. a) What is the Schrodinger equation for the motion of charged 
particles in a constant magnetic field B= Be,? Choose the following vector 
potential: 

A=(-— By,0,0) and @=0 


b) Show that the following separation of variables: 


i(ax + fz) 


W(x, y,z)=e p(y) 


used after substituting y = y’ — hac/eB, leads to the equation of a harmonic 
oscillator. 
c) What are the energy eigenvalues? 


Solution. a) It can be easily verified that the chosen vector potential in fact leads 
to the magnetic field B= Be,. 


As has already been shown [see (9.8)], the Schrodinger equation (in its 
stationary form) is 


i (a-£4) ir) = Ev). (1) 
m (E 


Calculation of the product [p — (e/c) A]? and insertion of A yields 


1 a e 0 re) 
ee ee eee — 
al Ox= (in ae aed 4 


e? Co 6? 
AS eee a es fee 
x C2 u dy? h =|y Ew (2) 
or 
nh? heB eB 
pL ee eh tele ee 
( 2m “me? Ox” me?” w a (3) 


b) The expression (x, y, z) = exp(ixx + ifz) p(y) with two constants « and 
f leads to 


2 ee 2) he Oo" | heBa es F 
2m 2m dy? ane ae 
x eilax + Bz) (p(y) = Eel(e* + 82) 5(y) (4) 
and thus 
h? dd? heBa EBs h? h? 
. = E ae ee ee ; 
( 2mdy? sme a Ime? Jou ( om am? Jou =e) 


That y « exp(iax + ifz)p(y) seems to imply that the particle is free to move in 
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the x and z directions (| Band ||B), and that this motion is related to the kinetic 
energies (h*/2m)a* and (h?/2m)B?, respectively. 
We will soon come back to this point. Now, substituting 


ae hea, — hin 4 
Pan eB Mo (6) 
and setting 
eB he 
=— , and ¢«=E-——f? 
00 me aw am? ‘ (7) 


the Schrodinger equation becomes 


pe ae hg Pee iz Pty: he 
— ——, + haox | y' — = - ) 
2m dy’? Ne © we Tae Mo oly) 


h2 
= (: = roo) (8) 


This can be simplified to 


io i 
(- Feat Foty ou ep(y’) . (9) 


Now we have, once again, the equation of a harmonic oscillator. Note that the 
“kinetic energy” in the x direction (h?/2m)a” has now been absorbed into the y’ 
degree of freedom. 

c) From the above we can immediately write down the energy eigenvalues, 
namely 


a= nn Se 01, 2c. (10) 
The functions g’(y’) are related to the Hermite polynomials and are located 
around 

y=] 0 ie Yo— — (ic/eB)a 


The total energy is 
h? 
BD) = ae ae eit) (11) 
2m 


Neglecting the motion in the z direction (B = 0), the energy E,(0) is quantized. 
For a given a, the wave function 


W(x, y, 2) = exp(iax + 18z) p(y) 
is localized in the y direction, but not in the x direction. This result is unex- 
pected, for both directions should be equally represented. However, as we have 
seen above, the energy is independent of «, so that we have infinite degeneracy. 
Thus wave packets of the form 


+60 


eee) ice acc, (yaa (12) 


hoe 
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where c(a) can be (nearly) chosen at will, are also solutions of the Schrodinger 
equation (2). Therefore we can choose c(«) so that the solution is located in the 
x direction, too. Such bound states in the x — y plane are unrestricted in the 
z direction, i.e. along the direction of the magnetic field B. They correspond 
classically to electrons orbiting perpendicular to B, but moving with constant 
velocity (momentum) along B and are called Landau states; the energy levels (11) 
are Landau levels. 


9.2 The Hydrogen Atom 


The most important example of the motion ofa particle in a potential field is the 
hydrogen atom. 

Electrons and protons attract each other with the force e?/r?, corresponding 
to the potential — e?/r. Here, r is the coordinate of relative motion, and is all 
that interests us for the moment. We choose the proton to be the centre of our 
coordinate system; the mass m used in the following context is then the reduced 
mass of the electron: 


m 1 
ee ee) by: 
"T+ m,/m, mf a8) Cae 


Since we have a central potential, we use spherical coordinates. The stationary 
Schrodinger equation is then 


- ae e2 
Hy = Ep =| — ~— — 
y= BY (5 Yu (9.23) 
The squared momentum operator 
WAG ae | 
By Pay | 2; z 
p= —h’A= —h (a5 it 740. 
can be divided with the aid of L? = — hi2Ay » into a radial part and a rotational 


part containing the angular-momentum operator L (see Example 4.9). Conse- 
quently, the Schrdédinger equation takes the form 

oe die ee e? 0 

—— IF —- — -___- — —" = 

rer or hee h? = r v4 cae 
In the Schrodinger equation a centrifugal term — L? /2mr? appears, similar to 


that in Kepler’s problem in classical mechanics. 
With the following separation of variables: 


R(r) 


WTB) ee (9.25) 
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ee 
it is possible to separate (9.24) into a radial and an angular part. We start with 
pice RU) _ 10°R(@) 
ror or 7 PGR 


(9.26) 


and introduce the separation constant /(/ + 1)h? to get 


nog Rr) ws om ee d+ 1 iia | Fogie ate 
r == —— — — 
Ri) or hi r rye) ee 
(9.27) 

Hence, we have the two equations 

a7 R; 2m e* (1+ 1) 

= [ae =a [Run =0 and (9.28) 

£7 Yin( 9, 9) = HPL + 1) Yim( 9), with 

[Ole andes —lain= 4 (9.29) 


The solutions of the angular differential equation (9.29) are the already familiar 
spherical harmonics Y,,,(3% @) (see Examples 4.8—9). The separation constant is 
the quantum number of the square of the angular momentum L? = h71(1 + 1). 
The additional quantum number m appearing in (9.29) characterizes the z com- 
ponent of the angular momentum L, > mh. [The solution of (9.29) will be 
discussed once more, in detail, in Exercise 9.4.] The radial function R,(r) 
depends on the total angular-momentum quantum number /, as can be seen in 
(9.28). We will soon see that the condition of square-integrability for the wave 
function (normalization) calls for another quantum number, the so-called radial 
quantum number n,. 

To find the energy spectrum, it is sufficient to deal with the radial part, 
because the energy E appears only in (9.28). Indeed, since the problem is 
spherically symmetric, the energy can only depend on the radial part R(r) of the 
wave function. (In the classical Kepler problem, the energy depends on the 
distance between the particles.) 


{wutaV=1 (9.30) 


leads to 


RnR dei (G3) 


Ooms s8 


because of the separation (9.25) and the orthonormality of the spherical har- 
monics. Here, we only determine the (discrete) bound states which are character- 
ized by negative energy eigenvalues’. 

To find a suitable substitution for solving the differential equation (9.28), it is 
useful to consider first the limits r > 0 andr > oo. For r > 0, the term containing 


1 A discussion of the solutions of the continuum (E > 0) can be found eg. in A'S. 
Davydov: Quantum Mechanics (Pergamon Press, Oxford 1965). 
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the angular momentum is dominant and one gets the equation 
d*R, I(l+1) 
ar r? 
Trying a power-series R; = r*(1 + ayr + ajr* + ...) and neglecting the higher- 
order terms, only the following lowest-order term remains: 


R,=0 . (9.32) 


a(a — Ir*-2 — 14 Uyr*2=0 | (9.33) 


The solutions for « are then « =/+ 1 and «a = —I. The case « = —! leads, as 
does the three-dimensional oscillator (cf. Exercise 7.2), to the same solutions as 
the case a =/+4 1. 

In the other asymptotic limit (r > 00), we can approximate (9.28) by 


d2R, 2 
ee (9.34) 
The abbreviation 
2 
i = E (9.35) 


is often used and logically chosen, since the energies of the bound states should 
be negative. In this case the solution of (9.34) is 


U, = Ae" + Be” , (9.36) 


where we have to exclude the second term, because it becomes infinite as r— oo. 
With the solutions of both extreme (“asymptotic”) cases, (9.32) and (9.34), we try 
the substitution 


ie sem nee (9.37) 


After inserting this into (9.28) and writing 


2 


Aa yee ali kaos : (9.38) 
we get 

(2F dF 

epi 22) 0 ta 0 (9.39) 


Recalling the mathematical discussion in Chap. 7, we recognize this as 
Kummer’s differential equation. The solution is given in (18) of Exercise 7.1. We 
exclude the second term of the total solution, as it behaves like r~2/7! 
(r— 0), ie. R,~ r~' is always divergent. 

Thus we obtain 


SO OL Va ee) . (9.40) 


To be normalizable, the confluent series should end at a certain term; this 
requirement leads to the quantization of the energy. 
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Setting 

[alk =n, n=O 1,2... , (9.41) 
we rearrange the terms, getting 

=i Ln (9.42) 


The number n is the principal quantum number (n = 1, 2,...) and is determined 
by the radial quantum number »n, (n, = 0, 1, 2,...) and the angular-momentum 
jUantumenumbers (— Ole. ), 
The definitions (9.35) and (9.38) allow us to determine the binding energy: 
me* 1 ee 


5 = Se (9.43) 


where do = h?/me? = 0.53 A is called the Bohr radius. 
When we set n = 1, we obtain the binding energy of the hydrogen atom in the 
ground state, 


sey o (9.44) 


The wave functions of the hydrogen atom are 
eee emcees h(t, 2) 57) Vin (one) 
R, (1) 


r 


ae a silos ~) "Y (9.45) 


where 7, = me?/h?n = 1/nao and with the normalization constant 


i (n + 1)! 


——— ——_  f [ap 3yP2 Sy — 
(21 + 1)! V 2n(n — 1-1)! (270) Gye = 1 be 40) 


Nop = 
The radial part of the wave function R,,(r) obviously depends on two quantum 
numbers, n and | (or n, and 1). The dependence of | results from the separation of 
variables in (9.25), by which the rotational term /(/ + 1)/r? was introduced into 
the differential equation (9.28), whereas the dependence on n is caused by the 
eigenvalue equation, which originated from the requirement that the wave 
function be square-integrable [normalization condition O21 

The Wyrm are eigenfunctions of the Schrédinger equation (9.24) belonging 
to the energy eigenvalues E,. Equations (9.41) and (9.42) allow the quantum 
numbers | and m to have the values 0 <! <(n— 1) and —! <m <1. Counting 
all possible states of the same energy, we see that every eigenvalue is degenerate 
n* times: 


y (i+ )an . (9.47) 


Tables 9.1 and 9.2 present the normalized wave functions for the lowest states 
of the hydrogen atom. In the second table, the wave functions are separated 
into the radial (R,.(r)/r) and angular (Yin(% @)) parts. The energies (E,,) depend 
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Table 9.1. The wave functions w,,,, for the lowest states of the Schrédinger hydrogen atom 


n tom Wnim(r, % P) E, 

1 0 Oe x yl? xen 1 

2 Oo se xy (1 — yr) xe 72F 4 

ye Cara ey CF xe 7 x cos 9 A 

2 Ae cl eae 5 ah xe? x sin Seti? 4 

3 0 OR Xs x 6) ee $ 

5 0 x yr xe" x cos 9 . 

3 1 +1 Fe xy? x(Q2—yrr xe" x sin Jeti? 5 

ey Ola <u xr xem" = Geos 9-1) a 

gs Paci ee Se a xe" x sin Icos Jet” - 

3 22 See OF? xe" x sin? Getie 5 
Table 9.2. The wave functions in Table 9.1, separated into radial and angular parts 
n loom R,ulr) GEA) 12, 
i 0 0 2 Seeley xe" x Ae 1 
a 0 2 <n ear) Xe x 4 
2) i SOC ks Xe, Ce yy cust! 4 
2 1 Si eeexX 73 cr xe rx /e x sind xen 
3 0 Oma XV Oe Ol 2) eal ce ar 4 
301 0 8 x yl? x (2— yr) aI ei Seton | & 
a ee ere Mee 
392 0 S& x yl? xr? xe x KG cose do) a 
3 2 +2 f& xy}? xP? xe" X\/gae OSI COs.) xc 9 ee 
392 41 S® xy}? xr? Ke ae SIN ey aL 


only on the principal quantum number n and are shown in the last column. 
ss energy units are —e?/2a) = —13.6 eV, ie. the ground state energy, and 

= 1/nag, ao = h?/me? = 0.52 A. Every state with the eigenfunction y,,,,, char- 
re by the three quantum numbers n,/ and m is an eigenstate of three 
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simultaneously measurable quantities: 


1. the energy E, = (—me*/2h”)(1/n7), 
2. the squared angular momentum L? and 
3. the projection of the angular momentum on the z axis L,. 


The principal quantum number n characterizes the energy level E,; the (azi- 
muthal) quantum number | indicates the magnitude of the angular momentum 
1’; and the magnetic quantum number m gives the size of the z component of 
angular momentum, L,. Thus the eigenvalues of the three quantities E,,, L? and 
L, are sufficient to determine the wave function w,,,,(r, 3 @). 

The probability of finding an electron with the wave function ,.,,(7, 9 @) in 
the volume element dV = r? sin $d dg dr is 


Watrnlls 3, p)dV = LW rtm (Ts 3, ~)|? dV Q (9.48) 
If we insert 


Rit) 
i 


Wie AWE 3, ~) = 


aes ”) > 


we can write the probability in the following way: 
Wants 3, pyr? dr dQ — Ru(r)ar| Gs ep) dQ : (9.49) 


Integration over dQ yields the probability w,)(r)dr of having an electron be- 
tween two spherical surfaces of the radii r and r + dr: 


wWedr=Wiaalr)r dr= Rarer. (9.50) 
For example, in the state W100, the probability is 
“Wages le eae (9.51) 


where Nyo is the normalization constant (9.46). (Plots of the probability 
w against r are shown in the lower part of Fig. 9.1). 

The wave functions of the hydrogen atom also describe the states of ions 
with only one electron, such as He*, Li**,.... The only difference lies in 
replacing the charge e” by Ze’, (cf. the last section of this chapter “Hydrogen- 
like Atoms”). 

If we deal with atoms with a nuclear charge number Z greater than 1, we 
have to replace dy by ao/Z and the maximum of the probability approaches the 
nucleus like 1/Z, ie. the electron is forced by the stronger Coulomb forces into 
an orbit closer to the nucleus. 

The maximum of the function R2,(r), ie. the most probable distance of the 
electron, is given for the state W1oo by 


ae ° 
ro = do 4 ado = er = OiS2 A . (9.52) 


This is the classical Bohr radius, since according to classical theory, the electron 
should move around the nucleus on a circle with radius do. 

With increasing principle quantum number », the maximum of the charge 
distribution shifts away from the nucleus; the electron is less tightly bound. 
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Fig. 9.1. The normalized 
radial functions R,,/r (left- 
hand side) and the nor- 
malized probability densit- 
ies w,; (right-hand side) of the 
hydrogen atom for principal 
quantum numbers n= 1, 
2 and 3 
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According to the radial quantum number n,, there are in general several 
maxima, a principle maximum and some supplementary maxima (see Fig. 9.1). 


9.3 Three-Dimensional Electron Densities 


Looking at the sketches in Fig. 9.2, the question arises why there are nonsym- 
metric states in the spherically symmetric Coulomb potential. Of course, the 
nonspherical symmetry of the wave function is immediately acceptable if a weak 
magnetic field is applied. The distributions shown are cylindrically symmetric to 
the z axis. The prominence of the z axis originates from the choice (orientation) 
of the spherical coordinates. Physically the z axis can be fixed, for example, by 
a (weak) magnetic field. The complete solution of the Schrédinger equation 
(9.23, 9.24) corresponding to the energy eigenvalue E,, is a linear combination of 
all Waims Since the wave function y, is n? times degenerate. Thus, in the absence of 
a magnetic field, we will generally have 
na l 


Wr = YS y nim Vntm ’ (O33) 
1=0 m=-l 

with arbitrary coefficients a,,,,. In particular, we can construct states wy, contain- 

ing the Warm With equal probability. Since the wave functions w,,,, are ortho- 

normalized, the squares of the expansion coefficients are in the latter case the 
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reciprocal of the degeneracy factor 
i 
a..\-=— ae 
[Gntinl 2 (9.54) 


This is true, as already indicated, if, for some physical reason (e.g. a magnetic 
field), none of the components is dominant. 

The superposition of all yy), in (9.53) is in fact a spherically symmetric state, 
which is easy to verify. If a particular direction is selected by external fields, 
degeneracy ceases, and the electron density is anisotropic (e.g. the Stark and 
Zeeman effects). 
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The energy values of (9.43) characterize the energy levels of the hydrogen atom: 


phe poe a lcs ‘ 
(ES) oc ee ee ie) 
During the transition of an electron from the level E, to another level E,,, the 
atom emits a photon of energy 


ho», = E, — E,: (Bohr’s frequency condition) . (9.56) 
Inserting E, (or E,-) we get 
etm [ 1 1 
Qe = ye mie = mz » BB SM sz (9.57) 
and the frequency 
i i 
Van! = R ae = “a > (9.58) 


where R = me*/4h* = 3.27x 10*15s~' is the Rydberg constant. The quan- 
tity E,/h is called the spectral term. The differences between these terms deter- 
Ie Coo, 

Figure 9.3 shows the energy-level diagram of the hydrogen atom and the 
most important transitions. We see that with increasing principal quantum 
number n, the differences between energy levels decrease, Le. 

lim E,=0 and lim(E,—£,-1)=90 . (9.59) 

ieee G) (88, 

If the energies are positive, the values are arbitrarily close together. This 
continuum describes an ionized atom. The ionization energy is the negative 
binding energy. 

All frequencies involved in transitions which end on the same lower state 
form a spectral series. The transitions to the ground states n’ = 1 constitute the 
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Fig. 9.2a. Section through 
the electronic density distri- 
bution | |? of several states 
of the hydrogen atom. The 
density of the hatching cor- 
responds to the probability 
density of the electrons 
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Fig. 9.2b. Three-dimensional 
plot of the electronic den- 
sity distribution ||? of the 
lowest states of the hydro- 
gen atom. The radial vari- 
ables are in units of the Bohr 
radius (0.53 A) 
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Fig. 9.3. Energy levels and E 
spectral series of the hydro- 0 
gen atom 


Paschen series 


Balmer series 


-13.6eV] 1 : 
n Lyman series 


Lyman series. The frequencies are 
1 1 


The transitions to states with n’ = 2,3,4 and 5 comprise the Balmer, Ritz- 
Paschen, Brackett and Pfund series. Recently, hydrogen-like atoms in highly 
excited levels up to n = 100 have been observed; they are called Rydberg atoms. 
Their diameter is around 10° times larger than the diameter of the ground state.* 


9.5 Currents in the Hydrogen Atom 


The operator of the current density 7 was introduced in Chap. 6 as 
wee ; 
Sete VY —wrVy) . (9.61) 

Here, ps denotes the mass of the electron. This letter is used to distinguish it from 

the magnetic quantum number m. 


The eigenfunction of the hydrogen atom (9.45) is written as 


R,u(r) 
r 


Vines = INy! Pye (S)ei"? 


where R,,,(r) is the radial part and N,, the normalization constant [see (9.45) and 
(9.46) ]. We use spherical coordinates to facilitate calculation. Then V reads 


ee ; 
(Lar ra rsinGdos © cee 


2 See also M.L. Littman et al.: Phys. Rev. 20, 2251 (1979). 
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The components of the current density are now 


ih 0 4) 
(nim) __ * * 
Jr Hy (Van ar Wii W in ar vam ’ 


eh ceaes pole 
Ja Qu (Yo 7 ag nim nlm ; ag Wntn 


(nlm) _ ih 1 a co e - 1 ra) 
Jo Qu (van rsin o op nim Winn painug dp Wnlm . (9.63) 
Then we get 


Cen — nt (I) jm | ime é n() * | m| —ime 
Wntm ar nim = Nz (Ro igs (Ae )5( r ie (A)e 


R(t) @ (Rul) 2 


=e N2 pln 2 
All ik (9)) r ar 2 nin ar Wnim (9.64) 
as well as 
ie ie 
Sete yey) ee : 
Wi vee ; ag nlm nlm r ag Wnt > (9.65) 
R,,(r) and P!™'(9) are real functions. It follows immediately that 
je=jg=O0 . (9.66) 


This is quite reasonable, because a current in a radial direction would cause the 
entire charge either to be collected in the nucleus or to be emitted from the atom 
after a certain period of time. 

The only nonvanishing component of the current is the @ component, as this 
is where the only derivative of a complex part of the function enters, according 
to the last equation in (9.63). 

The current density in the @ direction is 


. ih 1 
Jo = Qu (van a a nim nim 9 : gin am 
hm 
eg (2 (9.67) 
pr sin 3 


meaning that the azimuthal current is mainly determined by the azimuthal 
quantum number ™m, certainly a very reasonable result. The notion of the 
electron circling the nucleus seems intuitively correct, and is based on the Bohr 


model. 
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If do is an area vertical to the current direction (see Fig. 9.4), the current dJ, 
passing through this area is 


dl,=j,do . (9.68) 


Fig. 9.4. Calculation of the 
magnetic movement 


PA 
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In standard texts on electrodynamics it is shown that the electric current dJ, 
circling around a plane area F, causes a magnetic moment dM = (F/c)dI.> Its 
z component in an atom is therefore 


F 1 
dNC— — di) dca. (7.09) 
C C 
We have to multiply the particle current density by the charge —e to get the 


electric current density, which is now needed. 
Since F = ar? sin? 9, the magnetic moment becomes 


7 
dM, = —————— |Wram|22r2 sin? Gdo (9.70) 
crsin 3p 
Le 
ehm ee 
dM, = ———|Wrm|?arsindda , 
CH 
and finally 
eee (9.71) 
2cp 


since dV = 2zrsin 3do is the volume of the current element through do and the 
integration over the normalized wave function is 1. As there are no supple- 
mentary currents in the atom, the magnetic moment is 


M=M,=—pgm , (9.72) 


where ply = —eh/2pc is the so-called Bohr magneton. The absolute value of the 
maximal magnetic moment is jigl; that of the minimal moment Is zero. 

Noting that the z component of the angular momentum has the value 
L,=mh, we get the gyromagnetic factor (for short: g factor) of the electron 
with respect to the angular momentum. It is defined as the ratio of the absolute 
value of the magnetic moment | M,| divided by the angular momentum in units 
of A, i.e. 


g= | Mz |/tp 
|L.\h” 


(9.73) 


and therefore g = 1. By definition, the magnetic moment is measured in units of 
fy and the angular momentum is measured in units of h, which explains the 
numerator |M-,|/s, and the denominator | L,|/h in (9.73). 
Since the electron has another angular momentum, the spin, we can define 
another g factor with respect to the spin. This will be done in Chaps. 12 and 13. 
As we see, there indeed exist real electric currents in the atom, similar to 
those which Bohr assumed to be caused by circling electrons. In quantum 


* See J.D. Jackson: Classical Electrodynamics, 2nd ed. (Wiley, New York 1975) and 
W. Greiner: Theoretische Physik, Bd. 3, 4. Aufl. (Harri Deutsch, Thun und Frankfurt 
a.M. 1986). 
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mechanics as well, the semi-classical Bohr model yields a vivid picture of the 
states described in an accurate manner by quantum mechanics. 


9.7 Hydrogen-like Atoms 


Ions or atoms having only one valence electron in the outermost shell can be 
described as hydrogen-like atoms. In Chap. 14 we will see that every electron 
state can be occupied by only one electron because of Pauli’s principle. Further- 
more, in our case, we have not taken into account up to now the spin in the wave 
function. As we shall see, the electron spin can take two values: spin up and spin 
down, with respect to the z axis. Thus a state W,,,, can be occupied by two 
electrons, as it should be for complete wave functions, instead of one. In 
addition, we have to consider the fact that the inner electrons screen the nuclear 
potential, this effect can be described by an effective nuclear charge number Zs. 
This number is Z (the number of protons in the nucleus) reduced by the integral 
over the electron density in a sphere of radius r: 


An" 
Zers(r) = Z— — § or’? de’ 
& 9 


=Z—4nx2> J lWam(r’ Pr? dr’. (9.74) 
nim 0 
Here, g is the total spatial charge density and the sum is extended over all totally 
occupied shells. In practice, the value of the effective charge number is deter- 
mined (fitted) by the experiment. This procedure provides a useful possibility for 
describing the spectra of the alkali atoms. 

Recently, with the development of heavy-ion accelerators, which allow the 
production of, e.g., high-energy uranium ions (up to | GeV/nucleon), it has 
become possible to produce “bare” heavy nuclei. For example, uranium nuclei 
without any electrons, or with only one or two electrons, have been observed. 
Clearly, the electron wave function of a 91-fold ionized uranium atom is 
a hydrogen-type wave function. However, relativistic and quantum-electro- 
dynamical effects become important for these large-Z atoms, and this opens up 
a new field of research. 

An iterative method of calculation is offered by the Hartree method. Here, the 
potential of an electron i is the superposition of the central Coulomb potential 
—Ze?/r; and the potential derived from the remaining electrons. This leads to 
a stationary Schrodinger equation of the form 


fe Zee ZF) ; 
a 2e EE i = Ey; , 1= eee Ze 
(= rj nee 2 a 4 

j#i 


(9.75) 


The terms in brackets indicate, respectively, the kinetic energy, the Coulomb 
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interaction energy of the electron i with the nucleus and the Coulomb inter- 
action energies between electron i and all other electrons. 

Thus we get Z coupled differential equations for the various wave functions 
w;(r;). Moreover, because of the quadratic terms in w;, these equations are 
nonlinear. They can be iteratively solved, starting with the hydrogen wave 
functions. The Hartree method does not lead to very accurate results because 
a quantum-mechanical interaction between two identical particles, the so-called 
exchange interaction, is neglected. This supplementary effect (which we will treat 
for the two-electron atom in Chap. 14) is considered in the Hartree-Fock method. 


CGS LL __—==SE=======E=——————SSSSaS= 


9.4 The Angular-Dependent Part of the Hydrogen Wave Function 


We have already introduced the Schrdédinger equation for the hydrogen atom 
and separated it into a radial and an angular-dependent part (with the separ- 
ation constant C). The differential equation for the angular-dependent part of 
the wave function [see (9.27) and (9.29)] is of the form 


NO Ye Yaa (1) 


We shall now determine the solutions and the corresponding quantum numbers 
of the differential equation. 
Even though we have already treated the spherical harmonics in Example 
4.9, it will be educational to derive them once more in a somewhat different way. 
The angular-dependent part of the Laplacian is 


5 hee ang C ie 6" 5 
= —| sin §— —=- 

*@ sin 3 a9 69) ° sin? 9 d—? 2) 

To solve (1), we insert this Laplacian: 


ee | ee ae 3 
ogo | Oo) ee ie (3) 


The variables ¢ and are separated as follows: 
¥(4, 0) = O(9)G(@) . (4) 
Multiplied by sin? 8/(O(9)¢(@)), (3) takes the form: 
sim od 7 0019) ors 1 6*4(@) 
@(9)sin 9 09 (sin 9 ag CS mr a one 


The left-hand side of this equation depends only on 9; the right-hand side, only 
on gy. So we set both sides equal to a constant K and obtain the different 
equations 


1 mals dO(8) 
sim 


O(9) _ 
ame) . 


sin 9 d9 
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d* $(9) 
BE. + K@(p)=0 . (6) 


The solution of (6) is 

$(g) = et Ke 
We require that the wave function be single valued. This means 

etivKe — etivKiet 22) _ gtivKe+2iVvKn 
From this equation we get K = m? and m = 0, 1, 2,3,..., where m is the mag- 
netic quantum number (quantum number of the magnetic moment), as we 
already know. Thus the integer values of m follow from the uniqueness of the 


wave function. 
With the substitution 


t=cos@ , sinG=/l—?f? , 


= Seacaens 


(5) takes the form: 


d dO m? 
efa-m& |+(c- a )e-0 | (7) 


To solve (7), we try 
Op a) s(t) 
The equation for v,,(t) then reads 


d d eee m? 1 Rae 2 
aya eye ta ) oni +(C- 7") d oct, 


from which we obtain, after performing the differentiation and reordering, 
aye) = 2 lt, ) + 6 mtn We) hs (8) 


Differentiation of (8) yields the same differential equation for v,,(t), with the 
coefficients m replaced by m + 1: 


OL CGS) Se Ne (m+ 1)(m + 2)](v;,) = 0 


Thus solutions v’,, = Um41 are possible or, represented by a function vo, 


d™ vo(t) 
t)=—"— 9 
iO) = (9) 
In the case m = 0, the differential equation (8) reads: 
(1 — t?)v§ — 2tv—o + Cup = 0 | (10) 


this is Legendre’s differential equation (see Example 4.8). We try to solve it in the 
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Exercise 9.4 


vicinity of t = 0 by inserting a power series 

volt) =dagotat+a,t?+a;P+-- , (11) 
where vo(t = 0) = do and vo(t = 0) = ay. 

To determine the coefficients of the series, we differentiate this twice, term by 
term: 

vy = A, + Zant + 3a,t? + 4agt? t+: , (12) 

vg = 2a, +3x2a3t+ 4x 3agt? +. (13) 
When we insert (11, 12 and 13) into Legendre’s differential equation (10), we find 
Stes Dens -ys (= Nat’ = ¥ rae +e Y at" =0 


v=1 v=0 

(14) 
Since the factor v(v— 1) vanishes for v= 0 and v=1, we can change the 
summation and write 


oO ie 9) 


a Gola =) vy lace 


v= v=0 


or 


OL Oa -> 2va,t” 


This simplifies (14) to 


Y vv Nay eo— SG Ola 
= v=0 


v=2 


To compare the coefficients of the power t', we have to set v =/!+ 2 on the 
left-hand side and vy =! on the right-hand side. We then obtain 


C2 ONCE Dies SUC = Cla (15) 


With the recursion formula (15) we are able to evaluate all coefficients from ag 
and a, because 


ee 


fs ioe 


ae = 


By successive insertion of this equation, it can be shown that the coefficients 
satisfy the general relations: 


RC (C= — (be = o= a5 
= 1) Ce 2 see Maer 
Seen 
(2k + 1)! 


The complete solution of the Legendre differential equation is then given by the 


Gages =(—1)'(C — 12)... [C = (Qk = IQ2k)] 
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sum of the two power series: 


fe ie 1° 
Vo(t) = ao ey eg — (C2 (CA) 
k 2k 
ey) OE NN Ol yet 
i a 
Fay si (C—1x2)5 + (C—1x2(C— 3x4 + 
k 2k+1 
wr: <= D =_ — — < ———————— eee 
+(—1(C — 1x 2)(C — 3x 4)... [C — (2k N21 oes t 


Each of these series diverges if it does not terminate at a certain point. The series 
can be forced to converge by setting either a7 = 0 and a, #1 or ag #0 and 
ao, — 0. 

Furthermore, we choose C = |(/ + 1), where / = 0, 2, 4,... in the first case, 
and / = 1, 3,5,... in the second case. Then only a finite number of coefficients 
are nonzero, and the power series converges, i.e. we obtain polynomials. These 
polynomials are the only solutions which are regular at || = 1, and which can 
be considered as solutions of the physical problem. They are known as Legendre 
polynomials P,(t)(1 = 0, 1, 2,...), (see also Example 4.8). They are normalized so 


Ley 
1839; 
nee 


% 


dese rae’ 
ie 
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The spherical harmonics 
Yin(@,@) for the lowest 


values of | and m. We 
have plotted the function 
| Yim(@, @)|? in spherical co- 
ordinates. For a given di- 
rection of 6 and @ in the 
coordinate system, the dis- 
tance of the surface to the 
origin equals the squared 
absolute value of the spheri- 
cal harmonics. 
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OX 


ri 


A diatomic molecule. The 
coordinate vectors r, and r, 
describe the centres (nuclei) 
of the two atoms. R indi- 
cates the centre of mass 


that P,(1) = 1, and they satisfy the following orthogonality relation: 


( P,(t) Py (t)dt = 


2 
ese 
- Ae 


Reversing the various substitutions, the complete solutions of (1) read 


; , d™ P,(cos 3) 
7 2 Oe a ee 
Y(3, 0) = Yin( Sp) = erm’ sin™ 3 d(cos 9)" 


We see, again, that the angular-dependent part of the hydrogen wave function Is 
represented by the spherical harmonics Yj,. 


EXAMPLE 


9.5 Spectrum of a Diatomic Molecule 


With the techniques we have developed so far, we now want to determine the 
spectrum ofa diatomic molecule in a qualitative way. The potential between the 
atoms 1s assumed to be local and not explicitly time dependent, it is given as 
a function of the distance between the atoms (see figure), 


V= Viry, ¥2) 


The Laplacian appearing in the Schrodinger equation has to be applied to the 
coordinates of both atoms: 


ics Ath 
ie ae ge 
An = d 
Ti ee 
ee 
A, 


aS + 
0x3 Wye ee 


Hence, the stationary Schrodinger equation becomes 


ie pe 
Sey 
2m, 2m, 

Introducing the centre-of-mass coordinate R and the relative coordinate r, 


the two-body problem can be reduced to an equivalent one-body problem. The 
following relations hold: 


As \Wrasr) + V(r, ro)Wlri. 2) = EWlri.r) . (1) 


MR = m,r, + mor, with the total mass 
M=m,+m, , and (2) 


fe = [Py = Ure . (3) 


We also have to express the Laplacian in terms of the new coordinates. Taking 
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the x coordinate as an example, we obtain with the definitions (2) and (3): 


x MX, + M2X> 
= ee sg KES XOX 


Thus, for the derivatives with respect to x; and x2, we get 


Be i Oe F and 
co my 0 7) 
OX M aX Ox 
Hence, 
he oo" pee here (eee 
~ in, Ox} 2m, 0x3 «2M OX? 2p Ox? 


where 1/je = 1/m, + 1/my, is the reduced mass. Analogous results follow for the 
other components, and the Schrédinger equation (1) takes the form 


h? fa 
— Sy Ane R) — 5 Ane, R) + VOr)H(e, R) = EWC R) 
Using w(r, R) = f(r) F(R) and splitting up the energy into E = E, + Er, we 
separate the differential equation into the centre-of-mass motion 


?y 
— 57 Ag F(R) = Ex F(R) (4) 


and the relative motion 


h? 
pe eS Ein) (5) 


Equation (4) no longer contains the potential; the motion of the centre of mass is 
free and described by a plane wave: 


F(R) = cexm( —7P-R) : 


where P? = 2MEg. This is quite reasonable, since we expect the molecule to 
move, as a whole, freely in space. 

In the equation for the relative motion we perform the usual separation of 
variables for a central potential: 


ae 


f(r) = f(r, 3, e)= Neate ~) 


This leads to the radial equation [see e.g. (9.28) J 
h? a?R 


~ 2n vos ap Ata rR 5 (6) 
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Qualitative form of the po- 
tential V(r) between two 
atoms 


Qualitative behaviour of the 
effective potential W,(r) be- 
tween two nuclei. ro,rs (or 
in general r,;) indicate the 
positions of the minima of 
the /-dependent potential 


with the effective potential 


Wid= Vine a (7) 


which is the sum of the true potential V(r) and the rotational energy L?/2pr’, as 
in classical mechanics. To get a qualitative survey of the energy eigenvalues E,, 
which constitute the spectrum of the molecule, we assume an acceptable form for 
the potential. 

As illustrated by the figure above, the potential should be repulsive if the two 
atoms are too close together. At a value rp, it should have a minimum and for 
large distances r, it should be attractive and tend towards zero. The repulsion for 
values r < ro is produced by the practically “naked” (i.e. without electrons) 
nuclei facing each other. The minimum at r = rg is caused by the molecular 
electrons, which move around both centres. [We shall explain this later on in 
greater detail (see hydrogen molecule in Example 14.5).] 

If the molecule has an angular momentum, the repulsive centrifugal poten- 
tial has to be added. Therefore the minimum becomes less marked and shifts 
towards larger distances, as shown in the figure below. 

In the case of small oscillations, the energy eigenvalues can be calculated by 
replacing the potential about the minimum by a parabola. Since the location of 
the minimum depends on the angular momentum, we call it r;,. Now we expand 
W,(r) around the point r;: 


z 
Ee oe er en 


W, =W 
(7) lee 5 dp 


r=fr) PST 


The higher terms are neglected, because we consider small oscillations about the 
equilibrium position |r — r;| <r; only. The second derivative can be written as 


d? Wi(ri) 


ae = he pa (9) 


This gives us approximately a parabolic potential. Since the first derivative 
vanishes at the point of equilibrium r = r;,, (8) becomes, using the abbreviation 
xXx =fr—- ry, 

Ra 1) ee 


ar = [ODT XS 


Wir) = Vi 
(r) (1) + Din 5 


With the moment of inertia O, = pur; , the Schrédinger equation (6) changes into 


hod? R Gee 1 
[rig EO =s 


On dx? a 


36, 5 wos? ]R = PK 


Accordingly, our approximation leads to the linear harmonic-oscillator equa- 
tion. This can be easily seen by substituting 


h7i(1+ 1) 


E’= E— ACR) = 20 ) 
I 
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which yields 
hedeRe ix*R = E'R 
DS ae 


As we already know, the eigenvalues of the linear harmonic oscillator are given 
by (see Chap. 7) 


Looe sl tO , 
and thus the whole spectrum results as 


h21(1 + 1) 


E= E,, = V(r) + ha,(n + 4) + 0 
SN | 


(10) 
The energy obviously consists of a rotational part h71(! + 1)/2@,, and a vibra- 
tional part ha,(n + 1/2). In addition, the vibrational frequency a, is determined 
by the rotation; w, depends on ! [see (9)]. Because of our approximation, the 
solution (10) is valid for small quantum numbers n and ! only. 

The rotations are observed in the far part and the vibrations in the nearer 
part of the infra-red spectrum. This means that the level density of the rotation 
at a given vibration energy exceeds the density of levels for different quantum 
numbers n. In other words, the rotational states can be classified according to 
the vibration states. The levels of vibration are equidistant, which we see from 
(10), while the ratios between the rotation levels are 1:2:3:4:..., if the moment 
of inertia ©, remains constant. The scheme of a rotation-vibration spectrum is 
shown in the following picture: 


E(n, 1) 


Canwhr uw 


Conon Oo ~~ Loe) 


aL 


Such rotation-vibration spectra also exist for nuclei and probably even for 
elementary particles; for such problems we refer to the literature.* 


n= 


Oe — LSE SSS 


9.6 Jacobi Coordinates 


The Jacobi coordinates are a generalization of the relative and centre-of-mass 
coordinates used in Example 9.5. The latter are suitable for describing a two- 


4 See, e.g.: J.M. Eisenberg and W. Greiner: Nuclear Theory 1, Nuclear Models, 3rd ed. 
(North Holland, Amsterdam 1987). 


221 


Example 9.5 


Rotation-vibration —_spec- 
trum of a diatomic molecule 
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Example 9.6 


The Jacobi coordinates for 
four particles, numbered, 1, 
2, 3, 4. r; are their position 
vectors. Ry, Ri23, Ri234 
are the vectors for the centre 
of mass of the first two, three 
and four particles, respec- 
tively. The Jacobi-coordi- 
nate vectors é; point from 
the centre of mass of the first 
i particles to the particle 
i+] 
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body system. But what does the treatment of an N-body problem look like? 
Jacobi coordinates give the answer. 
First we take the particles 1 and 2 and treat them in the usual way: 


Wy Xy 


1 = 2) = 2G Bg 
m, 
My Vy _ 
iy = a hae a ees 
my 
m,Z, 
OF Ge 2 2200 (1) 
my, 


The first Jacobian vector €; = {&,, 71, 6, } is the relative vector between particle 
1 and particle 2. The second Jacobian coordinate is now defined as the relative 
vector between the centre of mass of the first two particles and the third one. The 
third Jacobian vector connects the fourth particle with the centre of mass of the 
first three particles and so on (cf. Sect. 14.2). Therefore we have 


mM, XxX, 
fi = To ee ? 
my 
mM,X, + M2X2 
ee ee 
my + m3 
j 
y ING Xg 
k=1 
i= j —Xj+i > 
ym 
hen 
1 N 
an mX,=X , (2) 


and analogous equations are valid for the 4; and the 6; components. This is 
illustrated in the figure below. 
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Evidently, éy is the centre-of-mass vector of the whole system. Now, we want to 
transform the kinetic energy operator 


yo h2 N 02 C2 C2 
= ee eee 
2 oe, My (25 v Oy; if =a) (3) 


to Jacobi coordinates. First we note that from the transformation formulas (2) it 
follows that 


os; 


——- = 5 k<j H = —]| 9 “—] 
On M; - ox, i Jar 
ee 0 a) oe where (4) 
éx, . i ) 
j 
M;= 3 My (5) 
k=1 


is the total mass of the first j particles. With the help of (4) and (5) we find: 


N oy N N Oy OC; _ N ow N ay 
ale TRE Lj= , OC; OX, j= Oo = 1 OX, 


gor im 0 7) 

2 a ee nL) = 3 | 6) 
j=i CSj \k= My OXj41 O¢N 

Note that (y/_,m,/M;—1)=0 for j< N. Only in the case j = N does the 

expression in Ba reathecce become nonzero and attain the value 1. 

The kinetic-energy operator is calculated in an analogous way. It is sufficient 
to evaluate the operator: 
ei N 1 o7w 

= De 


ay xe 


N N N arw Wee OC jn 
a a Os OE OE» OX, OX_ (7) 


Then we get, by using (4) and (5), 


i N N N ae a2 
Dea 
M Es My ale Py M,Mj, 0¢;0¢;' 

Le le Lie Note eos 

= 9 ys oP 

k=2 j= .m, M; Glee 1 3 ~=2 Ren 

ff me Oy Nine ) 
25.) 

Le = (23 2 : _M;M; GE BE PU ee 


k=1 Mk =k fj 


Nf an 0 tae OW 
(eae Oe? +a) 


I 


=n My, 
ee 


N rhs N ON aoe 
=a 2 d Dear Be, dE; =a Fae, ; 


josk j>s k=1j 


al i My a7 1 | 
a + — = 
as ae M? 0&2 | m, de2-y 


Jak 
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m Ne ieee 
“el ele eT 
M;M; eae jHlk= “, M Cj pay 
ic. a arw 
2 Wie Wee a oer 
gek j POS Il 
N 1 a7 NO gH 1 1 6? W at) 
[because of (5) ] 

LM, M; 88 ,0€,;. ~ 2. as 0€ OC, 1 
Nii G-ty e e a 
3 Nive 6c? im, 6c; 
i= ij k fe= Il 
yt ae N ae * | 
= M; 0¢50¢ fei P= i M; 06 ;00;" 
m, 0? i faa’ ) 
i= 1 MF 0G} my, O€f— 1 
met (8) 
M? ag; My Copan 


Thereby we have changed the order of summation over k, j and j’ at (*). The last 
sum transforms into 


N N 


» 


jak 


al 


k= My 
ae 
ee 


J=1k=1 
N 


mg 
2) 
M; 


Jj 


W 


which means that 


_1@y 
= M 0&2 


m 
M? 08 
1 27y 
= Mj 0c; 


ae 


a2 
Obh—1 


1 


62y 
ag? 


x OW 


iN 


any 


fee, 
k=1 M+ Clas 
LAE 


j=i1 M+ 4 0g? 
l 1 
Mj+1 
N=] | 07 


M: 
jaa By 0g} 


N= I 


ING 


a2 


e (9) 


7 


(10) 


> 


where ju; is the reduced mass of the centre of mass of the first j particles and the 
particle with the number j + 1: 


ee (11) 
by Mj; mjyay 
Taking into account that 
Dy =(D,+ D,+ Dw, (12) 
we get from (10) the relation 
bea ea er 
OG (13) 
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This is the kinetic-energy operator (up to the factor —h?/2), expressed in Jacobi 
coordinates. As expected, the centre-of-mass motion separates. It is described by 
the first term on the right-hand side in (13). 


9.8 Biographical Notes 


BOHR, Niels Hendrik David, Danish physicist, *Copenhagen 7.10.1885, Copenhagen 
18.11.1962. A professor from 1916, in 1920 B. became director of the Institute of 
Theoretical Physics at the University of Copenhagen. In 1913 he succeeded in applying 
Planck’s quantum hypothesis (1900) to Rutherford’s planetary atomic model. This Bohr 
model was the first to explain theoretically the spectral series of hydrogen. B. generalized 
this model to include the description of other elements and developed a theory of the 
periodic system of the elements. His correspondence principle, which was named after him, 
established a relation between the classical and the new quantum theory. In 1920 
he received the Nobel Prize in physics. Together with the young W. Heisenberg, B. 
developed the “Copenhagen interpretation” of quantum mechanics in 1927, the now pre- 
valent physical interpretation of quantum-theoretical formalism, based on the Heisenberg 
uncertainty principle and the well-known duality of particles and waves. Later on, he 
worked on problems of nuclear and elementary particle physics. From 1933 to 1936 he 
used the “sandbag model” for describing nuclear reactions of collisions. His interpreta- 
tion of the nuclear fission of uranium was important for its later technical use. From 1943 
to 1945, B. worked on the development of the atomic bomb in Los Alamos. 


LANDAU, Lew Dawidowitsch, Soviet physicist, *22.1.1908 Baku, +1.4.1968 Moscow, 
director of the Institute for Theoretical Physics of the Soviet Academy of Sciences. 
L. investigated, in particular, diamagnetism and low-temperature physics. In 1962 he 
received the Nobel Prize for his explanation of superfluidity especially as it appears 
in He II. 


RYDBERG, Janne (John) Robert, Swedish Physicist, *Hamlstad 8.11.1854, {Lund 
28.12.1919. From 1901 on, a professor in Lund, R. worked on the periodic system of the 
elements and series spectra. In 1889 he submitted his “Recherches sur la constitution des 
spectres d’émission des éléments” to the Swedish Academy of Sciences. The Rydberg 
constant and, recently, Rydberg atoms were named after him. In 1913 he published 
his papers “Elektron, der erste Grundstoff” and “Untersuchungen tiber das System der 
Grundstoffe”. 


LYMAN, Theodore, Amer. Physicist, *Boston 23.11.1874, +Cambridge (Mass.) 
11.10.1954. From 1910 to 1947, L. was director of the Jefferson Physical Laboratory at 
Harvard University. He was a pioneer in the field of UV spectroscopy, and in 1906 he 
discovered a series of the hydrogen atom in the UV range, which was named after him. 


BALMER, Johann Jakob, Swiss mathematician, *Lausen (Basel) 1.5.1858, +Basel 
2.3.1898. B. taught at the Basel Lady’s College and from 1865 until 1890 was also lecturer 
at the University of Basel. In 1885 he was the first to construct a formula describing those 
parts of the hydrogen spectrum known at that time (Balmer series). Later he recognized 
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the relation between the constant h of his formula and the Rydberg constant and its 
significance as the limit of the series. 


PASCHEN, Friedrich, German physicist, *Schwerin 2.1.1865, tPotsdam 25.2.1947. 
P. was professor in Tiibingen and in Bonn. In 1924, he became president of the 
Physikalisch Technische Reichsanstalt Berlin and a professor in Berlin. He constructed 
very sensitive galvanometers and quadrant electrometers and worked together with 
C. Runge, mainly in spectroscopic experiments. In 1908 he extended the Balmer formula 
to the IR lines of the hydrogen spectrum (Paschen series). In 1912/13, together with Back, 
he discovered the Paschen-Back effect: the splitting of lines in strong magnetic fields. The 
Paschen Law (1889) states that the ignition voltage of a discharge in a gas depends only 
on the distance between the electrodes and on the gas pressure. 


BRACKETT, F.S., American astronomer. 


HARTREE, Douglas Rayner, British physicist and mathematician, *Cambridge 
27.3.1897, tCambidge 12.2.1958. From 1929-37, H. was a professor of applied mathe- 
matics, and then of theoretical physics, at the University of Manchester; from 1946-58, he 
was a professor in Cambridge. H. became a member of the Royal Society in 1932. His 
most important achievement was the development of approximation methods for the 
calculation of quantum-mechanical wave functions of many-electron systems. Further- 
more, he worked on problems of digital calculators, ballistics and atmospheric physics. 
His main publications were Numerical Analysis (1952) and The Calculation of Atomic 
Structures (1957). 


FOCK, Wladimir Alexandrowitsch, Soviet physicist, *Petersburg 22.12.1898. F. be- 
came a professor at the University of Leningrad in 1932. In 1939, he joined the Academy 
of Sciences of the USSR and collaborated at several institutes. F.’s main field of work has 
been quantum mechanics and quantum electrodynamics, he is one of the founders of the 
quantum theory of many-particle systems. In 1926 he set up a relativistic equation for 
particles without spin in magnetic fields, independent of O. Klein. In 1928, together with 
M. Born, he demonstrated the validity of the adiabatic principle in quantum mechanics 
and developed an approximation method for wave equations of many-particle system 
(the Hartree-Fock method) in 1930. From 1932 to 1934, he generalized the Schrodinger 
equation for systems of variable-particle number in so-called “Fock space”. He also 
worked on the interpretation of quantum theory, general relativity (“Theory of space, 
time and gravitation” 1955), the theory of elasticity, and the theory of refraction and 
propagation of radio waves. 


10. The Mathematical Foundations 
of Quantum Mechanics II 


10.1 Representation Theory 


The state of a particle is completely described by the normalized wave function 
W(r, t), which we have used until now. In the Schrédinger equation, 


(= ate vin) Winn= in wer hs (10.1) 


which gives us the evolution in time of the state, we expressed the momentum 
operator by the differential operator, i.e. 


p=-ihv . (10.2) 


This representation w(r, t) of a particle state is called the coordinate representa- 
tion. Because of Heisenberg’s uncertainty principle, the momentum p of a par- 
ticle is not exactly known if its position r is fixed. According to (3.50), the average 
momentum is 


<p> = Jr, (iV Wr, dV. (10.3) 


We can extract information about the momentum of a particle from the wave 
function w(r, t) if we expand it in terms of eigenfunctions of the momentum 
operator; this is simply a Fourier transformation. The Fourier integral reads 


l 
W(r,t) = On Gan?! oP 9) erp ( 5p rep 
=falp, dy (r)d?p . (10.4) 


The integration is extended over all of momentum space; the function a(p, t) 
is the Fourier transform of (r,t) at time ¢. The plane waves y,(r) are eigen- 
functions of the momentum (see Example 4.4). Indeed, we have 


: a ith pw Be a ‘yr )=pw 
men ah rt ee P| Pn Eee 


(10.5) 


Now, by inspection of (10.4) it becomes evident that the function a(p, t) de- 
scribes the particle state as completely as the function w(r, t). We call a(p, t) the 
momentum representation of the state of the particle. With the reciprocity of the 
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ee 


Fourier transformation, it follows from (10.4) that 


apt) — EEE { w(r, t)exp (ier) dr =r yd ae (10.6) 


Hence, if W(r,t) is known, we can construct a(p,t) according to (10.6); and, 
vice versa, if a(p,t) is known, we are able to construct W(r, t) through (10.4). 
Analogously, the equivalence of the normalization can easily be shown: 


lv)? der = fla(p. yr dep (10.7) 


Indeed, (3.41) expresses this fact for particles within a box. The relation corres- 
ponding to (10.3) for the average of the position operator reads 


<P> = fa*(p, t)(iAV,)a(p, td? p (10.8) 


where V, = (0/0px, 0/Opy, 6/Op-) is the nabla or del operator in momentum space. 
Indeed, we can easily calculate with (10.4) 


ry = FUR Orb, Oar 
=| d?rd°pd*p' a*(p, twp (r)ra(p’, hp (r) 
={d3pd?p' a*(p,tha(p.t){ Orvs(ryrb p(n). (10.9) 


Now, by use of the first equation in (10.5), we can replace the vector r in the 
space integral by 


fr WF (rrp (0) = [UF (4) (iV, Up (n)dPr 


= —iAV, | Wt (r)W,(r)d?r = —ihV,-6°(p — Pp’) 
(10.10) 
so that (10.9) becomes 


<r) = fd? pd? p' a*(p, tha(p’, t)(—ihV,,)6°(p — p’) 
dpa" (p lias at) (to (pn) ee 
— | d*p'(—ihV,,)a(p’, t)0°(p — p’)] 
= | d°pa*(p, t)GhV,)a(p, t) 


The function a(p, t) represents the momentum distribution of the particle state 
wW(r, t). The absolute square |a(p, t)|* gives the probability of finding the particle 
with definite momentum p, i.e. with the wave function 


J i 

W,(r) = (Qnhp? exp( 50°F] 
in the state (r,t). Hence, |a(p, t)|? is the probability density in momentum 
space. 

Up to now we have based our considerations on the physical point of view 
that the coordinate wave function (r, t) of a particle is determined by measur- 
ing its spatial distribution. The momentum distribution follows by Fourier 
transformation. But often in physics we must adopt a reverse approach; for 
example, in electron scattering experiments, momentum distributions (form 
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factors) are measured. Then the (spatial) charge distribution of a nucleus follows 
from a Fourier analysis (see. e.g., Example 11.8). 

Coordinate representation and momentum representation are equally suited 
to describing the state of a particle. Equations (10.4) and (10.6) allow transition 
from one type of representation to the other. 

Let us now briefly consider the energy representation. For simplicity 
we assume the particle to have a discrete energy spectrum with eigenvalues 
E,, E2,...,E,,... and a corresponding system of orthonormal eigenfunctions 


Wi,W2,.--,W,,.... The expansion of the general wave function y(r, t) in terms of 
energy eigenfunctions reads 
Wir, t= dia (tyrlr) , (10.11) 


n 


where the index n indicates the energy dependence. We can get the expansion 
coefficients from (10.11) by multiplying it by w* and integrating over the whole 
space: 


Col) | CE (10.12) 


It is clear that the state of the particle is completely determined by the set of a,, 
ie. the energy representation. Indeed, w(r,t) and the a,(t) follow from one 
another; the transformations are given by (10.11) and (10.12). This is completely 
analogous to the former situation in which we were able to evaluate w(r, t) from 
a(p, t) with (10.4) or a(p, t) from W(r, t) with (10.6). 


EXAMPLE 


10.1 Momentum Distribution of the Hydrogen Ground State 


To apply our formalism we evaluate the momentum distribution of an electron 
in the ground state of a hydrogen atom. The normalized wave function of this 
state is given by [y, = 1/nao; see (9.45, 46) and Table 9.1] 


ei exp( “| ; (1) 


Tag 


where w denotes the frequency and dag, Bohr’s radius. The momentum represen- 
tation is given by 


i i 
a(p.t) = Baas Ute, nexp( —jprr av (2) 
Inserting (1), we get 
ewan va) r i 
= SO oS —_— —-prjdadV . 3 
ae ie Rehan). i exp( exo n? ) oy 


To simplify our integral we choose the z axis parallel to the momentum and get 
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Example 10.1 


i dV 
v 


3 


Special choice of the coordi- 
nates to evaluate the inte- 


Ib(r)|? 
i ao 
la(p)|? 
_ Pao 
i h 


Probability distribution in 
configraton (top) and in mo- 
mentum space (bottom) for 
the hydrogen ground state 


in spherical coordinates (see figure) 


-iot 


a(p, t) = aa Jeon — © )exp( - = prcos 9)? arsin 948 a 
(p, t) ee i] ( = j ( c 3) dcos 9 dy 
a(p, t) = —=== | J exp| - — exp{ — — prcos r 
Tey 2aeh 6 do] = h 
Angular integration yields 


Sm See 00 1 ; 
ae { fexw| —r(L+i2) | exo] —e(2 i) [bra 
mp./2agh o a oh a, ih 


and from this equation, it follows directly that 


1 205 3/2 e7 iet 
non) TPIT : 


The probabilaity density in configuration space is obtained from (1) as 


a 2 
WG Hee exp[ ee ] | 


(4) 


a(p,t) = 


while we obtain the density in momentum space from (6) as 


lap. OP Bag U 
a(p, = ooo; 
mth*[1 + (p*ag/h*)}* 
The form of both densities can be shown as: 
Integration over the momentum density also yields the correct value of one: 


de “i 
a = 


Here we have substituted pay/h = x. The momentum distribution (7) can be 
verified experimentally’ by observing photoelectrons in ionization experiments 
or by measuring inelastic electron scattering. Relation (7) has been confirmed in 
such experiments. 


sy 32 
la(p)?d°p = J |a(p)/?4np* dp = — | (8) 


10.2 Representation of Operators 


The operator equation 
o = Ly (10.13) 


transforms a function w to another function g. For an explicit calculation we 


10.2 Representation of Operators 


have to choose a certain form of representation. Until now we have used the 
coordinates r to express an operator; i.e. we have been working in coordinate 
representation. In this case, the operator L generally takes the form 


A: i 
i= L6.n)=£(F¥.0) (10.14) 


If we change the representation of the wave function, we have to transform the 
Operator accordingly. 

Let us consider first the energy representation. We expand the wave functions 
wir) and g(r) in (10.13) in terms of eigenfunctions of the energy, i.e. of the 
Hamiltonian (Hy, = E,W,). Thus we have 


Wr) = DianWalr) and o(r)=P bane). (10.15) 


n 


The energy representation of the functions y and @ is then given, according to 
the previous section, by the set of coefficients a, and b,, respectively. 


To get the energy representation of the operator L, we insert the expansions 
(10.15) into (10.13), yielding 


y baWn = Ly an a ope, 


After multiplication by w* and integration, it follows from this equation that 


y bn Onn = Yan (PREY, Vv. (10.16) 


This suggests introducing the matrix element 
oe © Taba 2 (10.17) 
as an abbreviation, so that we can now write for (10.16): 


Hew Bee & (10.18) 


This equation is the energy representation of (10.13). The total set of the Lyn, Le. 
the matrix L,,,, constitutes the energy representation of L. As already indicated, 
because of the two indices, the L,,, are combined into a matrix. Since both 
indices run over the same set of numbers and because the number of energy 
eigenvalues is infinite, the matrix L,,, 1s a quadratic infinite matrix. 

To give an exampe of an operator with continuous eigenvalues, we shall now 
compute the momentum representation of (10.13). The problem is simplified by 
considering the one-dimensional case (r > x, p> px = Pp). 

Since we are looking for the momentum representation, we expand in terms 
of momentum eigenfunctions, 1.e. 


i i 
= —== exp|{ + Ps) 2 
Ye yap c 
but now we do not write them explicitly, and have 


W(x) =fa(pw,(xdp , e(x)=Jb(p)y(x)dp_. (10.19) 
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eee eee aaa ae 


The functions a(p) and b(p) are the momentum representations of w and g, 
respectively. Inserting (10.19) into (10.13) yields 


{ b(p)wp(x)dp = L | a(p)W,(x)dp = J a(p)Ly,(x)dp_ (10.20) 


Since the operator L is assumed to be given in coordinate representation (10.14), 
it depends on x and not on p. We can therefore write it under the integral. 
Multiplication by y*(x) and integration lead to 


{ b(p)dp § W(x) h(x) dx = f alp)dp | WE) LY (x)dx (10.21) 
With the orthogonality relation 


J WE (x) W(x) dx = 5(p' — p) (10.22) 
and the abbreviation 

Dy COELIAC Es (10.23) 
we get the momentum representation of (10.13), 

b(p')=JLypalp)dp . (10.24) 


The indices p and p’ are continuous and hence the matrix element 
Lies = Llp. P) 


is a function of the variables p and p’. But the term “matrix element” is also used 
in this case. The infinite matrix (L,,°) is the momentum representation of L. For 
explicit calculations see Examples 10.2 and 10.3. 

Now we want to review some laws of matrix calculus and show their validity 
for matrix elements of operators. 

A matrix L = (LinnOmn) 18 called diagonal; especially for L,, = 1 we have the 
unit matrix E = (Omn). The matrix which is called the complex conjugate matrix of 
L is defined by 


(Le = (Ue, UB (10.25) 
The transposed matrix L of L = (Lim) is 
EN (Ebon (10.26) 


It is obtained from the original matrix by transposing the indices, i.e. mirroring 
the matrix elements on the principal diagonal. The elements of the adjoint matrix 
L* fulfil the relation: 


AwSeioh= (am . (10.27) 


In the case L = L*, we call the matrix L self-adjoint or Hermitian. Now we show 
that a Hermitian operator is represented by a Hermitian matrix. Indeed, 


= iS ee * 
Linn = | WELW, dx = | by LA wadx = ( We LW ix) i ae (10.28) 


Two matrices are added, component by component: 


Cn a (10.29) 


10.2 Representation of Operators 


Let C be the sum of the operators A and B. We can show that the matrix 
corresponding to C is the sum of the matrices corresponding to A and B: 


Coun = [WRC ndx =f W8(A + BYU, dx 


= fweAbndx + WABU,dx = Ann + Brn - (10.30) 
The multiplication of matrices is defined as 
Cr, = Joh Bun : (10.31) 


Let us prove that the matrices of operators fulfil the same relation. If C= AB, 
then obviously 


Con = J WE CU, dx = [ VEABW, dx = [ WE A(BY, dx. (10.32) 


We set (Bw,,) = ¢, (x) and expand this function in terms of orthogonal functions 


W(X) 
C(O = = ee (10.33) 


with coefficients byy: 
Dig = EN ae (10.34) 
By inserting (10.34) into (10.33) and the result into (10.32), we obtain for Cyn: 


Con = jusd(y bus) = [WEY dyn Ay,ax 


= eee (10.35) 
k 
In analogy to (10.34), we define 
fWE(x)AV(X)dx = An (10.36) 
and verify with 
Cre =o Dyna mk —_ = Blades (10.37) 


that the multiplication rule (10.31) is also valid for matrices which belong to 
operators. 

In the following, the wave functions g(x) and w(x) are represented by the 
numbers b, and a,, respectively. We want to transform the equation 


et (10.38) 


into matrix form. Therefore we use the column vectors 
(a,)=] a2 | » (bn)=|] 52] - (10.39) 


Now, we can replace the equation go = Li, which is equivalent to 
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bn = ¥,, LmnGn, bY the matrix equation 
(bn) = (Linn (Gn); (10.40) 
where (L,,,) corresponds to the operator L. Explicitly this equation reads: 
b, Lelie Py 
Do || Layee da | (10.41) 


We call (a,) and (b,) the representations of the wave functions y and 4, 
respectively in the chosen basis w,. 

The expectation value <wW\L|W> of an operator L in the state w(x) is easily 
determined in matrix representation. We have 


L= WOILIWO)> = J W(x) LU (x) dx 
=a J dx Ds GA (Cue ise) 


n,m 


Sa are (0) Lice) ee, deena 


nim 
ay 


=cat.at,..)(% a) Goals (10.42) 


The results of this section are quite important. We have learned that in addition 
to coordinate representation, a whole variety of representations exist for ex- 
pressing quantum-mechanical relations. Later on in Chap. 12, this fact will 
prove useful for describing spin. 


EXAMPLE re 


10.2 Momentum Representation of the Operator r 


Let us transform the operator of the x coordinate * = x into momentum 
representation. According to (10.23) it is 


XB. — j Wr (X)XWp(xpdx (1) 


For momentum eigenfunctions we take again the plane waves 
_ exp[i(p,/h)x] 


W(X) a ./2mh 


and get 


Sno = 5g SeaP( ~ iE x xexp (14 x a ; (2) 


10.2 Representation of Operators 


This can be written as a partial derivative with respect to p’: 


eee. | ( i d 
sone pO? Px)x |dx 
ae 


This is the momentum representation of the operator of the x coordinate in matrix 
representation. With the momentum representations h(p,) and a(p,) of the 
functions (x) and w(x), respectively, the equation (x) = xw(x) becomes 


b(Dx) = J Xp.p.4(De) aD 


a 
Sih or) lap ap. 4 
) & (p ps) )alp’ddp (4) 
Partial integration yields 
g , i 00 F t ea x u 
bis) = iBL3(0% — psa) 2.0 + ih f 840% — px) 2 dps 
_ é 

= ih—a(p,) . (5) 


ODx 
Comparing this equation with g(x) = W(x) = xw(x), which is in coordinate 
representation, we note that the coordinate x is replaced by the operator 
ihd/Op,; the latter has to be interpreted as the momentum representation of the 
operator x. In the case of the other coordinates y and z, the derivation is similar, 
and so we obtain the following momentum representation of F: 


PS (6) 


where V, is the nabla or del operator (gradient) in momentum space. 
The following table shows the connection betweeen the momentum and the 
coordinate operators: 


Representation P Pp 
Configuration space 

(coordinate representation) r —ihV 
Momentum space 

(momentum representation) ihV, Pp 


In coordinate representation, #=r is simply an ordinary vector whose 
components are numbers; p = —ihV is a vector whose components are differen- 
tial operators with respect to x. In momentum representation, the situation is 
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reversed: ¢ = ihV, is a vector whose components are differential operators with 
respect to p, while p = p is an ordinary vector with numbers as components. 


US 


10.3 The Harmonic Oscillator in Momentum Space 

Now we show that the solution of the one-dimensional quantum-mechanical 
oscillator yields the same eigenvalues in momentum space as in configuration 
space. To do this, we replace the Hamiltonian in x-space representation, 


fy pee ee eee (1) 


by the Hamiltonian in its momentum-space representation (see the table in 
Example 10.2): 


= ee (2) 
Mm 


2 2 
EU eg 
(= a Z <)¥ Na ©) 
2. 2 
p eee, 
— —;-E)W=0 . 
(E zak ap? y 0 (4) 
We divide both sides of the equation by m’w? and get, after reordering 
ho? p? E 
(- Snap tai ae v= , ©) 
With the substitutions 
f= and w= 
mea = aa (6) 


the differential equation (5) becomes 
h? 0? as m GP, we Dd ig ae 0 
2mop 2 ees a Gh 


This equation takes the well-known form of the oscillator equation in con- 
figuration space [see (7.4) ]. Hence, all the conclusions reached in Chap. 7 can be 
applied with the result that 


E’ =ho'(n+4) . (8) 


10.2 Representation of Operators 


By resubstituting E’ and w’ we get 
E=hao(n+4) , (9) 


i.e. the same energy values as in our former calculation in configuration space 
(see Chap. 7). Furthermore, the wave functions are of the same form in both 
representations. 


In the following we give the matrix representation (we shall also call it the 
matrix form) of several operators. This will be useful in our later studies. 


Matrix of the Coordinate Operator x in Configuration Space 
We show that the matrix form of the coordinate x in x representation is given by 
Ke OX — x IP, (10.43) 


Indeed, the laws of matrix multiplication give 


O(x')= J XxxW(x)dx = j x(x — x')p(x)dx = x'b(x') , (10.44) 
ie. the matrix for % produces the right factor (eigenvalue) x’ in the equation 
@(x') = x' W(x’). Hence, we call x,., = x'd(x — x’) the matrix form of the coordi- 
nate x in x representation. 


Matrix of V(x) in Configuration Space 


Let V(x) be an arbitrary function of the x coordinate. As above, we insert 
Vix = V(x')6(x — x’) into the equation 

p(x) = V(x) h(x) (10.45) 
and get 


ol) = | VeaWiodx = fF Vix) — x woode = VOW’) 
oH ra (10.46) 


Obviously, (10.45) and (10.46) are identical. Hence, V,.. = V(x')6(x — x’) is the 
matrix form of the potential in coordinate (x) representation. 


Matrix Form of the Momentum Operator 
in Configuration Space (x Representation) 


This matrix reads 


box = ih Lx’) (10.47) 
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By way of proof, we insert it into 


p(x) = py(x) (10.48) 
and get 
OE = ‘ Dy (X)daa=— ih i 2. O(x' — x)W(x)dx . (10.49) 


Partial integration yields 
. 7) 
p(x’) = ih[6(x' — x)W(x)|2, — ih f d(x’ — ae HAS: (10.50) 
The first term vanishes again and so 
. 0 7) 
ate eae: ide = —in i TOS 
p(x’) ih ak Oi — *) A WIx)ax ha V(x) (10.51) 


which is the standard form of (10.48) and verifies our assumption. 


10.3 The Eigenvalue Problem 


An important and frequently encountered problem in quantum mechanics 
involves finding eigenvalues and eigenfunctions of a given operator A. If the 
operator A is given in the representation of its eigenfunctions, the diagonal 
elements of the corresponding matrix A,,,, are just its eigenvalues. Let us develop 
methods for finding eigenvalues and eigenfunctions of the operator A if it is not 
given in its eigenrepresentation. 

The eigenfunctions , of A fulfil the equation 


Aa(x) = apa(x) (10.52) 
We expand them in terms of functions gy, which are not eigenfunctions of A: 

Walx) = ¥ chute). (10.53) 
The combination of (10.52) and (10.53) yields 

AY c%@, = Gy Ch One - (10.54) 
After multiplying by g# and integrating, we have 

cA 0c ae (10.55) 


n 


where the adbreviation A,,, means 
Am =JotAg,dV . (10.56) 


Let us now assume A,,, is given and the eigenvalues a and the eigenvectors {a%} 


in (10.55) for the given matrix (A,,,) are to be computed. If we know both, the 
eigenvalue problem is solved in any representation, since with {c@}, we can 


10.3 The Eigenvalue Problem 


construct via (10.53) the eigenfunctions of A, ie. W(x), in x representation, too. 
To find the cj, it is convenient to write (10.55) in the form 


d (Ag, — An) en= = 0 : (10.57) 


Obviously (10.57) represents an infinite homogeneous system of equations for 
the coefficients cj. Such a system has a nontrivial solution if the determinant of 
coefficients vanishes, i.e. 

det(Agn — AOy,) =O. (10.58) 


The problem is that, in general, this determinant is infinite. To solve (10.58), we 
consider secular determinants of Nth degree: 


ali = A12 ; Ain 
A Asp—a : A 
Dy G)= 4 ore “leo. (10.59) 
Ani Anz : A 


This is a truncation of the expansion (10.53) at a certain value n = N. We check 
for convergence by increasing the parameter N. The equation Dy(a) = 0 is of 
Nth degree and therefore yields N solutions for a. These solutions 


ala) a) (10.60) 


are all real, because Dy(a) is the determinant of a Hermitian matrix (the operator 
A is assumed to be Hermitian, as all operators in quantum mechanics associated 
with observables should be). 

’ Now we evaluate each eigenvalue a; for a sequence of increasing determi- 
nants Dy and get a sequence of solutions: 


Gea ea!) Sa (10.61) 


The convergence of this sequence can be explained physically. The matrix 
elements A,,, measure the correlation between the states g, and ¢,. But in the 
case n > k, this connection will be negligible (for example, highly excited states 
hardly disturb the ground state). Then the A,, usually get very small and 
contribute only very little to the first roots of the secular determinant. 
We insert each of the so-calculated a; into (10.57) and obtain the coefficients 

c,(a;) and, with (10.53), the eigenfunctions 

Var = Liewlaonx) - (10.62) 
When the spectra and matrices of the operators are continuous, we get an 
integral instead of a sum in (10.57) and this equation becomes a Fredholm 
integral equation of the second kind: 


J A(, E)c(E)dE = ac(S') . (10.63) 


We shall have to deal with such continuum problems later on in quantum 
electrodynamics in the discussion of spontaneous vacuum decay. They also 
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appear in the decay of bound states into several continua. At this point we shall 
not deal with them any further. 


10.4 Unitary Transformations 


An operator A can be represented by matrices in several ways. Indeed, for any 
complete set of wave functions w,(x), we can construct the corresponding 
representation of the operator A [see (10.17) ]. Now we consider the transforma- 
tion behaviour of these matrices when changing the representations. 

An operator A may be given in a representation with a basis of functions 
w,(r), which are the eigenfunctions of an operator L (ie. Ly,(r) = LW, (1). 
iiens 


An, = | WA) Au, nav (10.64) 


is the L representation of the operator A. On the other hand, a representation of 
A with eigenfunctions g,(r) of M [1.e. Mo,(r) = M, ¢(r)] is also possible: 


w=feznAe(raV . (10.65) 


To distinguish between these representations, we use Latin indices for the 
L representation and Greek indices for the M representation. 

Now we want to determine the transformation matrix which connects (A,,,,) 
with (A,,,). Therefore we expand the eigenfunctions of M in terms of eigenfunc- 
tions of L: 


n=) SiVn + (10.66) 
Multiplication by w* and integration yield 


[vto,dV= d Si One Sma Oe (10.67) 


Obviously the matrix element Sy is the projection of » onto the state @,. 
Replacing the M eigenfunctions (in 10. 65) according to (10.66) leads to 


Ay = J) Si VEAL Smim dV = © SiS |W Aim dV 


Ayy = i St, See (10.68) 
Now, using the elements of the adjoint matrix 
(Sse (10.69) 


gives the transformation rule between the matrices of A in the two represen- 
tations: 


(Ap = > (Sin Ava) (Seon (10.70) 


n,m 


or, denoting the matrices by capital letters only: 


10.5 The S Matrix 


The indices M and L refer to the various representations of A. The requirement 
that the y,, as well as the ~, be orthonormal wave functions implies the unitarity 
of S. This is shown in the following derivation: 


On Nl Od — (ok 2) Se ig y Siu Sny Oman 2 


m 


on == y, Semon = (S7 Sin ; (10.72) 


i.e. the product of S and its adjoint matrix S* is equal to the unit matrix: 
Sas = ES (10.73) 


Unitarity also means the equivalence of the adjoint S* and the reciprocal matrix 
S~'. We note that a unitary matrix is not necessarily Hermitian: 


SY ea ae (10.74) 


The physical meaning of the unitary transformation (10.66) is the conservation 
of probability: if a particle is in a state @, with probability 1, it can be found with 
the probability ||S,,|7 in the states y,. The set |S,,|7,..., |S,al, --. then gives the 
probability distribution of the particle with respect to the states y,. Therefore it 
must hold that 


sae SS) (10.75) 


i.e., according to (10.72), S is unitary. 

An important and frequently used theorem is the invariance of the trace of 
a matrix under unitary transformations. The trace of a matrix A is denoted by 
TrA and is defined as the sum of all diagonal elements. According to (10.68) and 
(10.72), we calculate 


TrAyy = > A = » 2 Sra ae ’ 
nm 


honym 


= > Am omionn ’ 


,n,m 
= y AUS Shas = 2 Ann ’ 
iene a (10.76) 


Hence, Tr Ay = Tr A,. Thus the trace of a matrix does not depend on the 
particular representation. 


10.5 The S Matrix 


The temporal evolution of a system can be described as a series of unitary 
transformations. The operator of this time-evolution transformation will be 
denoted by S$; the corresponding matrix is the S matrix (scattering matrix). We 
now derive the S operator and show some of its properties. 
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The operator in question has to transform a state at time tf = 0 into the state 
al tinie f 


wir, t) = S()w(r, 0). (10.77) 


If we insert (r,t) into the time-dependent Schrédinger equation, we can 
determine S, namely 


(in — 8) Stomino)= (in HS) Hn =0 


AS ee 
ih—-—HS=0 . (10.78) 
ot 


In the case of H not being explicitly time dependent, the following solution 
results: 


$= exp( — 5 Ar) (10.79) 


From (10.77), it follows that $(0) = 1. Therefore the integration constant in 
(10.79) is set equal to 1. If we apply the operator S to a function as in (10.77), we 
expand the exponential function into a power series: 


; dp ee ae 
Sm exp(-jAr)=D4( iar) (10.80) 


n 


We give special attention to the energy representation in which H is diagonal, 
Rew — yc) With 


w(r,0) =) anWn(r) (10.81) 


n 


we get the temporal evolution of p(r, 0) by applying the operator S according to 
(10.77). This gives 


VEO) = Sie, 0) = 3 ay, 


Syir.0) = ¥dyen( — 5 Eat Wal) (10.82) 


Obviously the well-known time dependence for stationary states follows. In the 
energy representation, S is diagonal, as we can see from (10.82): 


Sinn = ) WES, dV = exp( = 7 Ext One : (10.83) 


10.6 The Schrédinger Equation in Matrix Form 


Equation (10.80) shows, too, that S$ is an unitary operator: 


: ae _ a ; 
+ =| exp(—i i) ~exp( 5°) = exp( iit) = 5-4 (10.84) 


because H is Hermitian. We now expand the wave function w(r, t) with respect 
to the eigenfunctions @, of the operator L: 


wr, t)= PS b(tonrlr) (10.85) 


If we again describe the temporal evolution by S, according to (10.77) we obtain 


Y ba(then(r) = ¥ Sb, (Oe(r) (10.86) 


Multiplication by ~% and subsequent integration yield the matrix equation 


b(t) = Yo Srn(t)bn(0) (10.87) 


wheres, = { e*Send V. Now let us consider the special case b,,(0) = 1. Then all 
other b,,(0), n’ #n, are equal to zero because of normalization. This means that 
in the L representation, the particle at time t = 0 is completely in the state @,(r). 
We can say that the system is prepared to be initially in state ¢,(r). Conse- 
quently, (10.87) yields 


as) oa Sl) B (10.88) 


This is an interesting result with a rather obvious physical interpretation. 

The matrix element S,,,(t) then yields the amplitude with which the system 
has passed over from state ¢, into the state @,, after time ft. Or in other words, 
the value 


w(n> m) = |Sinn(t)|? (10.89) 


gives us the transition probability from state @,, into state @,, under the influence 
of H. This relation will play an important role in our subsequent calculations of 
transition probabilities of a quantum-mechanical system and in the calculation 
of quantum-electrodynamic scattering processes (transitions from an ingoing to 
an outgoing state).' 


10.6 The Schrodinger Equation in Matrix Form 


As an example of the formalism developed so far, we look at the solution of the 
Schrodinger equation 


in = Ay, ie 


1 Compare this with the next chapter (time-dependent perturbation theory, golden rule). 
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and use the energy representation for the wave function, ie. the eigenrepresen- 
tation of the not explicitly time-dependent Hamiltonian, 


Hin = EnWn - (10.91) 
Expanding the wave function with respect to eigenfunctions of the Hamiltonian, 

Wir, t=Ya(tynlr) , (10.92) 
and inserting it into the Schrodinger equation (10.90) we get 

: Oa, 

ih », ae W,(r) a ys EG, ,(F) > (10.93) 
where we have also used (10.71). Multiplication by y* and integration yield 

0a,,(t 
ih a Lo) ~, (10.94) 


Only the fact that we have chosen the energy representation of H with 
Hin = En Omn iS responsible for the differential equations for a,,(t) not being 
coupled. The solution of (10.94) is: 


a,(t) = a,,(0) exp( - i Eat | : (10.95) 


The amplitudes of the stationary states are time dependent; the integration 
constants result from the initial conditions. If we use a representation other than 
the energy representation, then (10.99) below, in which we can see the coupling 
of the different amplitudes a,,(t) by the matrix elements, is valid. 

Now, in a similar way, we want to calculate the temporal change of the mean 
value of an operator L. The mean value is given by 


CLS Pa (10.96) 
Inserting the expansion of the eigenfunctions (10.92) yields 


KL> = JL aROWAOLY an alr)dV 


= % TEU Lan) > (10.97) 
according to definition (10.17) of the matrix element. Equation (10.97) gives the 
mean value of the operator in matrix representation as a function of time. We 
take the temporal derivation of the mean value and get 


d< Ly oan oL 
— Y bs n a _ n ae 
dt iat ae a ce 8, 


myn n,m an 


0a, 
min 
ot 


(10.98) 


The temporal derivatives of the evolution coefficients a,,(t) are now expressed 
according to (10.90) and (10.93) by the matrix elements H,,, of the Hamiltonian: 


eae 
ih = 
Ot 


/ 


Sida (10.99) 
k 


10.6 The Schrédinger Equation in Matrix Form 


This is the Schrédinger equation (10.90) in matrix representation;? it is valid in 
any representation. In the energy representation (10.91), especially, it reduces to 
the simple decoupled equations (10.94). If we insert (10.99) and the complex- 
conjugated formula into d<L)/dt of (10.98), then we get 


d< Ly 1 aL 
= ee * LT % ~o?mn 
dt ih ya Hin Enna + 2, a Ct ay 
i 
a= ve ee ee awe : (10.100) 


As the Hamiltonian is Hermitian, we have 
Ax = Him; (10.101) 
and with a change of the indices of the first and third terms [in the first term we 
substutite (m, n, k)— (n, k, m)], we are able to summarize as follows: 
EE OL 
dt ot 


1 
On + = Dam Y (Linn Hat — Ln Han) OK (10.102) 
ie tis n 


According to the rules of matrix multiplication and with the introduction of 
operator products, a further simplification is possible, namely 


Oe 1 ee 
ae yak = 7 Ee ob 7 aL SUL) an (10.103) 
m,k 


We combine the double sums and introduce the commutator [H, L]_ so that 

d<L»> oe Clare 
Brat 

dt ot 


results. Setting d¢ L)/dt = <dL/dt> and using (10.97), we get the matrix element 
of the temporal change of the operator: 


4 mt EI) (10.104) 


Ns De _ CH, £] 10.105 
Gi tcrs * he eo) 
We already deduced this result in Chap. 8, and used it to derive Ehrenfest’s 
theorems; now we have it in matrix representation. 


? Heisenberg [Z. Phys. 33, 879 (1925)] introduced matrix elements as the quantum- 
mechanical analogue to the Fourier amplitudes of classical mechanics. Just as a classical 
quantity is determined by its Fourier amplitude, the corresponding quantum-mechanical 
quantity should be given by all these matrix elements. Heisenberg did not initially use the 
expression “matrix element”; Born and Jordan first ascertained [Z. Phys. 34, 858 (1925) ] 
that the multiplication law for quantum-mechanical quantities, given by Heisenberg, is 
identical to ordinary matrix multiplication. The entire theory was further developed and 
became firmly established with the help of matrix calculus [M. Born, W. Heisenberg, 
P. Jordan. Z. Phys. 35, 557 (1926)]. The papers referred to here can be considered 
fundamental to the development of quantum mechanics. 
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10. The Mathematical Foundations of Quantum Mechanics II 


10.7 The Schrodinger Representation 


In our previous description of the dynamical evolution of a physical system we 
used time-dependent state functions w(r, t). The physical quantities, at least the 
not explicitly time-dependent ones, are described by time-independent operators. 
We call this type of description the Schrodinger representation or Schrodinger 
picture. 


10.8 The Heisenberg Representation 


In the Heisenberg representation (Heisenberg picture), the situation is reversed: 
the wave functions are time independent and the dynamical evolution is described 
by time-dependent operators. 

The two representations are completely equivalent in describing a system; 
they lead to the same expectation values, the same spectra, etc. The transition 
from one representation to another one is given by a unitary time-dependent 
transformation, as we will see below. 

To explain the different types of representation (which are sometimes also 
called “pictures”, i.e. Schrodinger picture, Heisenberg picture etc.) we look at 
a matrix element of an operator L: 


(a =e ( Vint t) Ly, (r, pyay : (10.106) 


For the wave function, we write in energy representation: 


Wmlt, t) = Wnt) exp( = : Ent) (10.107) 


The time dependence of the stationary state is given by an exponential factor. 
Inserting this into the integral (10.106) yields 


Emlt) = ¥ Wile) exp (; Ent Lvatryexp( - z Eat av 


= [pi(r)Lexp (; (En — E,) w,(r) dV 


En =f WR Ln(bryaV (10.108) 


Of course, the matrix element has not changed during our manipulations. The 
equations (10.106) and (10.108) differ only in that the time dependence is in one 
case [(10.106)] in the wave function y/(r, t); in the other case [(10.108)] in the 
operator L,,(t). The operator in the Heisenberg representation is thus 


> = A i 
js Lu = Lexp € (Ee = F,) é (10.109) 


This is true if the operator is not explicitly time dependent. In the general case 
we can describe the transition from Schrédinger to Heisenberg picture by 


10.10 Biographical Notes 


a unitary transformation. With the operator 


$= exp( - jr) (10.110) 
we get 

Wulr) = S~"Ws(r, 0) (10.111) 
for the wave functions, and for the operators 

EeQy—S *(t)L.S(Q) , (10.112) 


where the index H stands for Heisenberg and S. for Schrédinger. A comparison 
with (10.108) and (10.109) shows the validity of transformation (10.112) in the 
energy representation. 


10.9 The Interaction Representation 


If we have a system whose Hamiltonian splits into an Hy part and into an 
additional interaction V, 


H=HA)+V , (10.113) 


we describe it by the so-called interaction representation (or interaction picture). 
In this description, both the state functions and the operators are time depen- 
dent. It follows from the unitary transformation 


Se exp (5 Hat (10.114) 


in the Schrédinger representation. Equation (10.114) is analogous to (10.110). As 
with (10.111), we get the wave function with 


et Sy eo) (10.115) 
The operator in the interaction picture is obtained as 
io Si LS) = (10.116) 


which is analogous to (10.112). 


10.10 Biographical Notes 


FREDHOLM, Erik, Ivar, Swedish mathematician, *Stockholm 7.4.1866, t{Moérby 
178.1927. His famous work on integral equations was published in 1903, where he 
established the fundamentals of the modern theory of this topic. He was awarded with the 
Wallmark prize of the Swedish Academy of Science and the French Academy’s prize. He 
was appointed professor of theoretical physics in Stockholm in 1906. 
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11. Perturbation Theory 


An exact solution of the Schrédinger equation exists only for a few idealized 
problems; normally it has to be solved using an approximation method. Per- 
turbation theory is applied to those cases in which the real system can be 
described by a small change in an easily solvable, idealized system. The Hamil- 
tonian of the system is then of the form 


H=H)+eW , (11.1) 


where H and Hy do not differ very much from each other. Ho is called the 
Hamiltonian of the unperturbed system; the perturbation eW (i.e. the adaptation 
to the real system) has to be very small; and ¢ is a real parameter which allows the 
expansion of wave functions and energies into a power series in ¢. The parameter 
é 1s also called the smallness (or perturbation) parameter. 

In this form we can describe a great number of problems encountered in 
atomic physics, in which the nucleus provides the strong central potential for the 
electrons; further interactions of less strength are described by the perturbation. 
Examples of these additional interactions are: the magnetic interaction (spin- 
orbit coupling), the electrostatic repulsion of electrons and the influence of 
external fields. For the present, we restrict ourselves to perturbations constant 
in time and a Hamiltonian Ho, the spectrum of which is discrete and not de- 
generate. 


11.1 Stationary Perturbation Theory 


We assume that the Hamiltonian is split up according to (11.1), and that the 
eigenvalues and eigenfunctions of the unperturbed Hamiltonian Ho are known: 


Hoo = Eowe . (11.2) 


We are seeking the eigenvalues and eigenfunctions of the complete Hamiltonian 
H, les 


HV=EW , (11.3) 
(Hy +eW)V=EV . (11.4) 


The desired exact wave function ¥ is expanded in terms of the known solutions 
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Fig. 11.1. Effect of perturba- 
tion: for ¢ #0 other states 
Ww, are mixed with ampli- 
tudes a, to the unperturbed 
state Wy. The latter is fully 
retained for ¢ = 0. The states 
in the vicinity of w° are 
more strongly admixed than 
those further away 


° of the unperturbed system: 
YnQ=Tawsn . (11.5) 
Inserting this into (11.4) and using (11.2) yields 
Y a,(E° — E+ eW)p? =0 
Multiplication by w?* and subsequent integration gives 


Vale, _ Om al ela =0 . (11.6) 


We have used the fact that the eigenfunctions are orthonormal: 


[wotyedV = Onn 


The matrix element W,,,, stands for 


Wan = Sm Win dV. (13) 
Equation (11.6) can be transformed into 
Gjjt, = Et eW,,)e ) ae Oe (11.6a) 
n#=m 


For e = 0, we have only the idealized state, with a2 = 1 and E® = E®, so that, 
according to (11.5), y = Ww. Now if e 40, the wave function will change and 
other neighbouring states yw? with n 4 m will be admixed (see Fig. 11.1). 

To calculate this, we use the fact that the perturbation is small. We expand 
both the desired expansion coefficients a,, and the energy eigenvalues E, in 
powers of the perturbation parameter e: 


0 1 2 
Ge eiea eeee a ee 


E=E,=E©+4 cE 4 2 E@4 (11.8) 


The numbers in the brackets show us the degree of the approximation, 
e.g. a” means that this coefficient is small in second order in ¢. We now insert 


the series (11.8) into (11.6a) and order in powers of ¢: 


(E28 — Eyal? + | — EM)al + (E28 — Ea + Y Maya 
nxzém 


ie? (Ha SE ai (EG Oa eee Be | 


ném 
ee (ce |e De (11.9) 


From this formula we can determine the energy values and the expansion 
coefficients in the various orders of approximation, which we shall now study 
systematically. 


Oth Approximation 
If we set ¢ = 0, there is no perturbation and (11.9) yields 


(A= a0 ae (11.10) 


11.1 Stationary Perturbation Theory 


m runs over all levels, m = 1, 2, 3,... . Let us focus on the level m = k and look 
for the change of its energy and wave function. Equation (11.10) then yields 
BOSE, ae by Gian 


Ist Approximation 


Inserting these values into (11.9) and taking into consideration only terms up to 
the first order in ¢, we have 


(a a Hom a5 | am a oe aE (ae oa eae 


+) Sn Hon | = 0 (a2) 
nAm 


The first term does not contribute at all because of the solution of the Oth 
approximation. For m=k, we get the energy shift of the k level in a first 
approximation as 

EW, (11.13) 
The admixture amplitudes for the other states follow from (11.12) form 4k: 


War 


(=e ja =e, — 0 and = 
EE, Ee 


Pe ke (11.14) 


In the case of m = k, we obviously do not obtain a condition for the a; irom 


(11.12). Hence, we have to determine a\” in a different manner, namely by the 


normalization of the wave function w,. Indeed for y,, we get, according to (11.8), 
in first-order perturbation theory 


y= Davs=E( 5 ota aus yom we so alee + y ays 


n#k 


= y+ o( ol +» at) (UL) 


ee eee 


Since the y, should also span an orthonormal system of wave functions, we get 


(ilWio =1l= Kwa lWad + Khe leag Wed + <eay We IWe> 
+ rap rye lay we = b+ e(ag? + ag*) + e7]an |? 
(11.16) 
Neglecting the term proportional to 6” (because we are calculating up to first 
order only), we have 
Q=c(af?+a?*) . (11.17) 


As the wave function is determined only up to a phase factor, we can choose the 
: 1 
a‘ to be real. Then, obviously, a’) = 0 results. 
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2nd Approximation 


If we insert the values of the first approximation into (11.9) for m = k, only parts 
of the coefficient of e? remain and for the energy it follows that 
poe y See (11.18) 
n#k Ise fae i 


and analogously for the amplitudes with m ¢ k, 
aq”) as Wi Wik War AE 


eee ae ye 


CE eer 8) = Oy = 188) 


é * «8 DS a 
Again, we do not obtain a condition for a in the case m = k, so we have to use 


the normalization condition of the wave function once more. This procedure can 
be continued so that the perturbation effects can be determined in any degree of 
approximation. According to (11.8) we got for the energy of state k in the 2nd 
approximation 


Pie sic, lily) 


m 


E,= Ey +eM,—@ ¥ Mn Wn (11.20) 
n#k ja a E; 

This contains the interesting result that, in first order, the correction of the 
energy is simply the expectation value of the perturbation W, which is quite 
reasonable. If k denotes the ground state of a system, E, < E°, and the effect of 
a second-order approximation is always negative, regardless of the sign of the 
perturbation, because 

a = (Uae euch 1) = oy SO 
are always positive. This is an important fact, which we can use in many 
problems, particularly in those cases in which the first-order correction 
W,, vanishes for one reason or another. 

For the application of perturbation theory, we assumed the perturbation to 
be small, i.e. the energy levels and their differences are not changed significantly. 
We can express this in the following way: 


EWan 


we <ol ior =e (11.21) 


Since the energies Ep, and Ey are very close to each other for large quantum 
numbers in the Coulomb field [see (9.43)], perturbation theory can only be 
applied to the case of strongly bound states. We have required during the 
derivation of the perturbation formulas that the nature of the spectra not be 
changed qualitatively. The perturbed states y, should continuously emerge from 
the unperturbed states y{°) if the perturbation W is turned on. 


11.2 Degeneracy 


Now we shall briefly discuss the application of perturbation theory to a spec- 
trum with degenerate states. Up to now we have talked about states without any 


11.2 Degeneracy 


degeneracy; indeed, for any energy E;, we have assumed that only one definite 
state y; exists; in a system in which degeneracy occurs, this is no longer the case. 
For a given level of energy E°, a series of eigenfunctions age = lyse 
might exist. (Here, B stands for one or more quantum numbers.) The energy 
eigenvalues are independent of f. Such a level is called f,-fold degenerate. 

If we go back to (11.6a), we will now have to write it in the following form: 


ieee age ar SW icone ae E) ar & Re np Wang = 0 3 (11.22) 
np #4 ma 


where, according to (11.7), the matrix elements are given by 
re (11.23) 


The energy eigenvalue E° of the unperturbed state contains no additional index. 
It is independent of « because of degeneracy; this is precisely the peculiarity of 
degeneracy. 

If we look at the Oth approximation, we can see the effect of perturbation on 
the degenerate state quite clearly. From (11.10), we get for the Oth approxi- 
mation for the level m =k 


Doe BOO. (11.24) 
Obviously, 
moe E? and a. =—a,40 for c=1...f, 
and a®2=0 for m#k 


The double sum over n and f reduces to a single sum over f for the Oth 
approximation (because n = k only) and we get for the kth level 


Si 
(Ep + @Woata — E)Qka + & Y axe Weaxg = 0 - (11.25) 
pra 
The index « runs from | to f(,. Therefore (11.25) represents a system of f, linear 
equations for the a). The determinant of the system is of dimension f,. It has 
to vanish if the linear system of equations is to give nontrivial solutions a 
i.e. solutions unequal to zero. Hence, 


E°+eW,-E Eines EW j, 


EW E°+6W,,-E-. eWay. a% 


E°+eW,—E 
(11.26) 


We have deleted the index k in the determinant because it always appears in the 
same way. Equation (11.26) is called a secular equation. It is an equation of 
degree f, for the determination of the energy E and it thus has in general 
f, solutions E,, for E. As the perturbation eW is small, the solutions are close 
together. In general the degeneracy of a level is lifted under the influence of 
perturbation and the formerly f,-fold degenerate state splits up energetically into 
f, close-lying states with energies E,, «= 1, ee 
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The appearance of a degeneracy can always be traced back to a symmetry of 
the system. For example, the (2/ + 1)-fold angular-momentum degeneracy of the 
state of a particle in the central potential (see Exercise 7.2 and Chap. 9) is a result 
of the spherical symmetry (isotropy of space) of the potential. If the symmetry 
is broken by a perturbation (broken symmetry), the degenerate levels split into | 
a series of neighbouring levels. Such a perturbation may be caused by an 
additional weak interaction (e.g. spin-orbit coupling causes so-called fine-struc- 
ture splitting) or by applying an external field. 

The eigenfunctions Pas of the energies E,, are special Maton combinations of 
the degenerate states Wen The corresponding amplitudes Ga are obtained os 
insertion of the solutions E = E,,, into (11.25), which can then be solved for ae 
The resulting eigenfunctions are then y the form 


On Y aisWia (11.27) 


Since we are now again dealing with nondegenerate levels, the further approxi- 
mation can be obtained in perturbation theory as before. 


EXAM PCE 


11.1 The Stark Effect 


As an example of perturbation theory, we now calculate the level splitting of 
a hydrogen atom in a homogeneous electric field. As we shall see, the effect of the 
electric field on an atom is to split the spectral lines. This phenomenon was 
experimentally shown by Stark in 1913. 

Experiments showed that the effect of an electric field on hydrogen or other 
atoms depends on the field strength; this is, of course, to be expected. But the 
effect is different for hydrogen than for other atoms. The energy levels of 
hydrogen (e.g. the Balmer series) for weak fields split proportionally to the first 
power of the field strength (the so-called linear Stark effect). The splitting of the 
energy levels of all other atoms is proportional to the second power in the field 
strength (quadratic Stark effect). 

There was no explanation for the Stark’ effect in classical theory; only 
quantum mechanics indicated how to understand this phenomenon. We shall 
now discuss the theory of the linear Stark effect in greater detail, restricting 
ourselves to the second level (n = 2) of hydrogen. 

The applied external electric field E (in the experiments it was 
10*—10° V/cm) is much smaller than the inner atomic one, which is caused by 


' A general discussion of symmetry problems in quantum mechanics can be found 
in W. Greiner, B. Miiller: Quantum Mechanics — Symmetries, 2nd rev. ed. (Springer, Berlin, 
Heidelberg, New York 1994) 
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the nucleus and is of the order E,u.) = e/ag = 5x 10° V/cm (dp is the radius of 
the first Bohr orbit). To solve the problem we use perturbation theory of the 
degenerate case. The potential energy of the electron in the external electric field, 
V, is treated as the perturbation. 

The first level W499 in the hydrogen atom is not degenerate. Therefore, in 
the simplest case, we start from the splitting of the second level. As we know, the 
hydrogen levels are n?-fold degenerate; i.e. four eigenfunctions belong to the 


energy E° = E} of the unperturbed hydrogen atom. These wave functions are 
(see) Table 9!1) 


1 —r/2a 
1 = Po00 = ae Yoo », (2s state) (1a) 
ao 
r/2a 
Q2 = Prin = ae Peo Yio , (2p States) (1b) 
ao 
r/2ag _, 
3,4 = Parsi = =e ee (1c) 


The four-fold degeneracy is lifted by the appearance of an electric field. Now 
there is an additional potential energy for the electron in the homogeneous field 
E because of the adjustment of the dipole moment er of the electron in the field. 
If we arrange the electric field in the z direction, the potential energy will be 
given by 


4 
oe Eee = = 2) Ep [= Yoo (2) 


Let Ho denote the Hamiltonian of the unperturbed system. The perturbed 
Hamiltonian will then be 


H = Ho le V 
We calculate the matrix elements of the perturbation according to (11.26) and 
use the functions @, introduced in (1). The matrix elements are of the form 


io @) 
Vig = § pxVogav 
aS 
Most of the integrands are odd functions of the space coordinates; this can 
immediately be seen upon insertion of the perturbation (2) and functions @, 
of (1). After integration, only the matrix elements V,, and V,, turn out to be 
nonvanishing. In this case, 


Dll ea eae 7/205) ee [4x 

By ly = \ 1 = 1P00 « ~*Pao(_¢|E|) Og 40 r¥ 30 Yio Yio Ble 
= 165) a/ 245 649 

2 

r 


| ee 
qi | Peao) = 


e~"/40 73 dr | dQ| Yio l?(9, 0) 


4 
ao 


ice) 


= GEES) gol — ae eon (3) 
0 


e 
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Example 11.1 


The four degenerate hydro- 
gen levels belonging to the 
main quantum number 
n= 2, split up by the Stark 
effect into three levels. The 
medium energy, which cor- 
responds to the unperturbed 
energy ES, is still twice de- 
generate 


Here, we have used the fact that Yyo = (4) '/*. Because of the degeneracy of the 
system, the general solution of the Hamiltonian to the energy eigenvalue Es 1s 
given by a linear combination of the functions @,: 


4 


We On (4) 


a=1 


To determine the coefficients a,, we use the system of equations (11.25), which in 
this case reads 


(Es a — Ej)a, + 5) ag Vig = 9 . 
pzo 
iene oe (5) 


Since all matrix elements vanish except for V;, and V2,, the system reduces to 
four equations, namely 


(E> Ean eas — Oe (6a) 
Vag (ho — oe (6b) 
(E5=—£)a,=0 (6c) 
(ES—E)ag=0 . (6d) 


To get a nontrivial solution the coefficient determinant has to vanish, 1.e. 


Be Eo, eee 0 0 
om See 0 0 
0 1 eos 6 1" 
0 0 0 Boe 


From this we easily obtain the four values for the energies of the perturbed 
levels, which are, as we know, solutions of this determinant. The result is 


be ipeihs 2 = be eM, 5 Eye BOM. (7) 


Obviously, the superposition of the homogeneous field did not completely 
cancel degeneracy. This can be explained by the fact that we now have cylin- 
drical symmetry instead of the spherical symmetry from before. In other words, 
no complete cancellation of symmetry occurs. The resulting splitting is shown in 
the following figure. 


Energy 


3ea|E 
z ea| E| 


Field Strength  {E| 
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The linear level splitting by the electric field appears only in hydrogen. It 
results from the linearity of V,, in E [see (3)] and because of (7). 

There is no / degeneracy in many-electron systems; therefore, no average 
dipole moment exists, but the atom is polarized by the external field. The 
so-induced dipole moment is proportional to the field strength; therefore the 
energies of the atom change with | E|*. This phenomenon is called the quadratic 
Stark effect. To get the wave functions (4) corresponding to the energy values (7), 
we insert the energies into the system (6): 

For E, = E, = ES, it follows that a, = a, =0, 

a3 and a, arbitrary. 
For E, = ES + Yo, it follows that a, = ap, 
Oe == hn = 0s 
For E,= ES — Vio, we seta, — — a>, a, =a, — 0) 


With the applied field |E| we thus obtain the following wave functions: 


fore Ee Pinay = 4393 + daQ4 = 43W211 


+ a4¥%o,-, , with Tee esi 


for E=E,+VY2: Yu= 


(1 + 02) 
ie P2 
= Flta00 + Vase) ; 


ee 1 Se 


1 
ae — (2) 


=; Swe: a W210) 8 (8) 
We 
It is easy to show that the matrix, built by the functions ¥),1,m.1v; is diagonal. 
The Stark effect with n = 2 is qualitatively interpreted in the following way. 
As the motion of the characteristic wave function of the electron is not spheri- 
cally symmetric, (because of the degeneracy of the 2s and 2p states), the atom has 
an electric dipole moment p. For this reason, an atom in the electric field 


Bee Oe, = 0 Be LE) 
gains the additional energy 
V=—(p-E)=—|plEcosy , (9) 


where » is the angle between the direction of the electric dipole of the atom and 
the z direction. 

If we compare this expression with (3) and (7), we see that the electric dipole 
moment of the atom is |p| = 3a0e, where the solution corresponds to y = 0, 
but Y, corresponds to y = a. For the third and fourth solutions we then have 


Dal 
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y= +4. The latter result is due to an electric dipole moment which is perpendic- 
ular to the electric field; for this reason, no additional energy is produced. In other 
words, the linear Stark effect in a hydrogen atom with n= 2 is caused by the 
characteristic electric dipole momentum p. 

The results of these calculations, obtained by the use of quantum mechanics, 
agree with the experiments only for weak fields (E ~ 10° V/cm). For strong 
fields (E > 10* V/cm), additional splitting occurs (the quadratic Stark effect), 
caused by the fact that the degeneracy in the angular-momentum quantum 
number is broken. The Stark effect vanishes totally if the field strength is greater 
than 10° V/cm. This phenomenon is connected with the self-ionization of atoms 
in the electric field: electrons in an excited level lose their binding to the atom if 
strong homogeneous electric fields are superposed. 


LOS! es 


11.2 Comparison of a Result of Perturbation Theory 
with an Exact Result 


Consider a hydrogen-like atom with a central nuclear charge number Z, con- 
taining one Is electron. 


Problem. Calculate the change in energy by increasing the charge of the nucleus 
by one (Z > Z + 1). Use first-order perturbation theory and compare this with 
the exact result. 


Solution. The wave function for the first electron reads 


1 ; me? 
~ Pe (1) 


The unperturbed energy is 
NOS a ra (2) 


In first-order perturbation theory, the energy change is given by 
AE, = (Wildly), (3) 


ie. by the expectation value of the perturbation operator AH. In our case, 
we have 


AH =AV=—— , (4) 
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This yields 


1 ES e2 
AL. —y34n i) irrtetr(— =| 
a 0 


r 


= — 4y%e? [ drre~?" = — 4y3 0? _, 
0 4y 
oie me* 
= ga ie ore (5) 


In comparison with this, the exact result is 


BAZ Tea) = -= Zee y= 27) 


me* 1 


The first-order perturbation theory approximation is obviously quite good for 
large Z, as would be expected. 
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11.3 Two-State Level Crossing 


Let the Hamiltonian Hy have two closely neighbouring levels with energies 
E® ~ E® and eigenfunctions va ®) Let all other eigenvalues be very 
different from them, so that the two levels can be considered as rather isolated 


energetically. 


Problem. Investigate the Hamiltonian 
H => Ho + V and 


a) Show by use of perturbation theory that on the one hand, only these 
levels contribute to the correction of the energy eigenvalues and, on the other 
hand, that one cannot use perturbation theory any longer. What must w? and 


ws be like? 
b) Show that perturbation theory can be improved by diagonalizing the 


two-level problem, which gives 
Ey, =4(Hi1 + Hor) £37 (Ai — Ho)? +4|Aial? . 


with 


= ful *nyS” dx 
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c) Plot E,2 as a function of A = H,, — H2,. What does dE = E, — E, 
look like if, in the first approximation, a level crossing occurs, i.e. if the potential 
V is such that H,, = 8337 


Solution. a) The first correction of the energy eigenvalues is 
Be Ea Vi— (1) 
and, in second order, 


EY = Ha- ¥ Kl? es Sb) (2) 
j#i 
For the two levels we have E{ x E}, i.e. the denominator is going to be small 
and we can neglect all other terms if 4; 40 (this means that for a radially 
symmetric potential, the wave functions yy, and ww, must have the same angular- 
momentum quantum numbers). In this case we can neglect all states except 
1 and 2. Nevertheless, because of the small energy denominators in (2) certain 
terms in all orders of perturbation theory are going to be very large and 
perturbation theory loses its meaning. 
b) Let the eigenfunctions of Hy be w?, w$ with 


Hy = EPYO , i=1,2 @) 


We diagonalize the total Hamiltonian H = Hy + V within this two-state basis, 
i.e. we search for 


HW =(Ho+V)y=EW , (4) 
with 
y=ay, +by> . (5) 


Multiplying from the left by y?* and W$*, and integrating over x, we get the 
following system of linear equations for a and b: 


(fit = ja Via — 0 = 
Vo1a + (H22 — E)b=0 , (6) 
which has a solution (a, b) ¥ (0, 0) if 


q (aay Yi2 ) 
et 
Vi, H,.—E 


= (Hy, ae E)(A22 = = (Vial? 


= E* —(H,, + Hy )E— Hy, + H22|Vj,2|? =0 (7) 
or 
Eyj = 2(Aiy + H22) +4./(Ai — Hy2)? + 4|K2/? (8) 


is valid. If we can use ordinary perturbation theory, ice. if 
JEY — E21 >|Vial and 
[EY — E21 > (Vor — Va (9) 
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is valid, we get from (8) the energies, 
E,= BE)” + Vit Yaoi? (EY — ES”) 


fe i,2 4 f= : (10) 


This is exactly the result of second-order perturbation theory. 
c) Let us abbreviate 


Abily — 5 


+ 


then 


Eig =Hy,—-4443/S47+4|K2/ . (11) 


The shortest distance between the energy eigenvalues is 2|V,2|, ie. the level 
crossing is prevented by the interaction. This is related to the so-called 
Landau-Zener effect, which we shall not discuss here. It should only be men- 
tioned that the Landau-Zener effect deals with the question of the transfer of a 
particle (e.g. an electron) occupying one of the two crossing levels. As we might 
intuitively imagine, the particle transfer to the other crossing state depends 
on how close the two levels come to each other and also on the velocity with 
which the level crossing is passed. Inserting the energies E,, > of (8) into (6), the 
new states can be easily calculated: 


0 0 
wi =aywy + bis? 

=Vrwy—(4+44-4./47 4+ 4|K2l?)02 , 
Wo = any) + by me 


Viowy — (+44 + 5./4? + 4|V2|7) pone (12) 
with the interesting result that the wave functions are intermixed strongly at 
the level crossing, but are practically unperturbed far away from the crossing 
point (large |A|). Moreover, We approaches pe far to the left of the crossing 
point (large negative 4) and it approaches Wee far to the right of the crossing 
point (large positive 4). For es! the situation is reversed. This fact can be 
expressed as follows: the unperturbed wave functions do indeed cross and 
remain unchanged except for in the immediate vicinity of the crossing point. The 

: (4) (4) (0) (0) 
energies E, and E3’ also change from one unperturbed value FE; and E, 


respectively, to the other one, ie. E ey and aoe respectively. The situation is 


illustrated in the figure below. 


7) 


Exercise 11.3 


Level crossing: energy lev- 
els E;, E,, dependent on the 
energy difference A of the 
unperturbed system. The val- 
ues of H,, and H,, are indic- 
ated by dashed lines 
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[keV] 


Pb +Pb 


0-state 


--~- 1-state 


400 600 800 1000 


Ri fm] 


Example of level crossings 
in an electronic superheavy 
quasimolecule, as it is found 
in a heavy-ion collision of 
lead on lead. These cross- 
ings play an important role 
in the ionization process 
occurring during such 
atom—atom encounters 


As an example of level crossing, we look at the electronic energy levels in 
a two-centered potential which is caused by two lead nuclei with a distance 
R between the two centres. This potential is no longer radially symmetric; it 
merely stil! has azimuthal symmetry, i.e. j, is still a good quantum number. 
Furthermore, the Hamiltonian commutes with the parity operator. The solu- 
tions, shown below, were obtained not with the Schrédinger equation, but with 
the relativistic Dirac equation, but this is of no importance here. If we start from 
an Hy which belongs to a two-centre distance R and passes over to R + AR, the 


potential changes by 
AV=V(R+AR;r)—V(R;r) , 


(13) 


Vn) — Ze ae 


r——e, 


Z 


This perturbation does not change the azimuthal and parity symmetries. The 
matrix elements V,, vanish if the states have different parity or different 
magnetic quantum numbers; otherwise, they are in general different from zero. 

Now we can understand the figure shown in the margin.* (We shall not 
explain all symbols exactly.) As we can see, the 3so state crosses the 3d5) 7 state 
at R x 650 fm. This is possible because Vj; = 0 (different m quantum numbers, 
which are called here o(m = 0) or x(m = 1)). On the other hand, the 3so and the 
3d5)20 states repel each other. These states have the same m quantum number 
and the same parity. The 3p3;.0 and the 3d3).0 states are able to cross each 
other, because they differ in parity (> 4; = 0) while agreeing in the m quantum 
number. 


LD a ee 


11.4 Harmonic Perturbation of a Harmonic Oscillator 


Consider the harmonic oscillator Hamiltonian 
i s Zs x h? @7 Co 
H=H,+W th —— ee 
y ue ° 2m 6x2 si a 
and let W = C,x*/2 be an oscillator perturbation potential. 


Problem. Calculate, using perturbation theory, the energy eigenvalues and 
compare them with the exact result. 


Solution. In this case, the stationary solutions of the Schrédinger equation with 
the Hamiltonians Hy and H are known, namely those of the harmonic oscillator. 


2 Figure appeared in G. Soff, W. Greiner, W. Betz, B. Miiller: Phys. Rev. A 20, 169 (1979). 


11.2 Degeneracy 


The exact eigenvalues of H are 


E, = ha(n + 4) , w= [ores ; (1) 


and those of Hp are 


ie Waa (ie) 2 ey = a (2) 


Nevertheless, we want to calculate the eigenvalues of H approximately by 
perturbation theory to test its effectiveness. It was shown in (11.20) that in 
second-order perturbation 


W,, W, 
E=E?}+Mm+ 
n#¥lE, 57 


(3) 


6») (4) 


To calculate W,,, we need some of the already-determined relations [see (7.78) ], 
and again introduce the coordinate 


a ae _ {Mao 
h 
The basis functions are [see (7.52) ] 


(moo /hy"4 gan 
(nV? 2" hy? « | Hc). (5) 


with 


z Ce 
Win = (b:1W lo.) = Wn = (6 a 


On = 


where H,,() are the Hermitian polynomials. The y, are the eigenfunctions of Ho. 
Then 


tem [Bbnci4 BS pa (6) 


We need, however, 


1 
eby= [Petes + [teen 


=4./n(n — Nona t+ (0+ Bon +32JV (2+ YDOt+APn+2 (7) 


With this we can calculate 


ie = (4 


+5 + i)(n+ Duara | (8) 


2 


ae 
a 


bn) =F ee IE be) 


2 MMo 
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Exercise 11.4 So we get for the energy of the ground state (/ = 0) 
Win M 
1) n On 
Orr: ~ 
As all %, = 0 except for n = 2, it follows that 
I I< Pol(C1/2)x712>1? 
E =~ hag + (bol(Cy/2)x71 bo > + 
2 Dig = Ree 
1 Oy i (CVA eae 
Ds ape = ( lA eon / Ie 
2 2 Ma, 2 shay — zhwo 
1 em wre 
ea, Pe ee eee 10 
2 oof 1 9G, (=) (10) 


because Cy = mao. 
In fact it is possible to take into consideration all orders of perturbation 
theory. The result will be 


i UC ll fowe 
E == hos oe. = 
2 oof "51 CE male) 


aay ere x35 Ci\* 
v Dolls (os LXax 6X8 \ Ca) 


Co 
lee. Cilio Cy Cue! 
=) ee eS a fee iB 
2 m iron 2 m Ds ao 


With this we have found the exact result in perturbation theory of infinite 
order. It is also clear that second-order perturbation theory (10) yields a correc- 
tion for the unperturbed oscillator towards the exact result for the modified 
oscillator (11). 


EXERCISE 


11.5 Harmonic Oscillator with Linear Perturbation 


Consider the harmonic-oscillator Hamiltonian 
oe Or 
Jno. 2 


x2 


H — Hp + W with Hp = 
with W = Coax as the perturbation potential. 


Problem. Calculate, using perturbation theory, the energy eigenvalues and 
compare them with the exact result. 


11.2 Degeneracy 
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Solution. In this case, the exact stationary solutions are known, too: 


hee 
= ah + Sex? + 2ax)y = BY (1 
We define y = x + a and transform to 
(ee Cos 5 
Bn ce lean —a*)y = Ep (2) 
or 
eno” 
~ Im aye ar yy = Ey ; 
c 
E'=E+ mie (3) 


Equation (3) is the ordinary oscillator equation with the known eigenvalues 


=i seg) ip eee 


w= [<2 (4 
m 
or 
Cc 
Geng) = ae = (5) 


2 


Now we try to find this result by perturbation theory. In order to do so, we again 
need the matrix elements 


| h 
Wn = <:|Coax|bn> = Coa Fil Collen 3 


pee 2 6) 


m 


For the energy of the ground state (/ = 0), we get, using 


h 
Weigh =< Pols Pn) 
MDMo 
h n n+1 
= | One | don : 7) 
Coa 1 (ff poet ae 5 0, »] ( 


Example 11.5 


The superposition of an os- 
cillator potential with a lin- 
ear potential yields a shift 
of the original oscillator; 
otherwise everything  re- 
mains the same. The dashed 
lines refer to the shifted 
coordinate system 
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Exercise 11.5 the equations: 
Woo 0 Wor =Coa fe f 
MWo V2 
Vie 0) omen Olle (8) 
This yields 

Ee. = 5 hoo +04 ce One ae 

ay — oe 

2 2mwo5 

= 5 hry ~ SP a" (9) 


This agrees with the exact result above. Therefore we can conclude that for the 
ground state the higher orders of perturbation theory have to vanish identically. 


11.3 The Ritz Variational Method 


It is possible to determine the ground state without the explicit solution of the 
Schrodinger equation by requiring that its energy be the lowest of all possible 
energies for all possible wave functions. To understand this, we consider an 
arbitrary Hamiltonian H and demand that its spectrum have a lower limit. This 
means that it has a lowest, nondegenerate energy eigenvalue: 


Feo. A Sie Uh ie) (11.28) 

EB, > Lown 0) (11.29) 
We can expand any arbitrary normalized wave function y into eigenfunctions of 
H and get 

W>o=Yakwh » Vial? =1 . (11.30) 


Then the mean energy of w is 


WIA = Lak dl Wind mn = Da Om En Sim 


a Enlai a Eo). la? = Eo : (ake) 


This obviously means that every other state w, which differs from the true 
ground state Wo, has a higher energy than wo. This result can also be written in 


11.3 The Ritz Variational Method 


the form 


_fwiAl> 
(=n 
rs in| Oa = 


Here, w € H indicates that is an element of the Hilbert space H. w need not 
even be normalized in this equation. Finding the energy of the ground state 
has therefore become a variational problem. That the expression in brackets 
(in 11.32) is stationary we know from variational calculus to be a necessary con- 
dition for the extremum (minimum): 
WIA IWW> 
o(E,) = 6 ——— =0 . (11.33) 
enc > 

With the well-known rule for differentiating the ratio of two functions, it follows 
that 


(OIA WI — WIA >OWIW>) _ 
<wily>? 
where it is sufficient that the numerator vanishes. As W is a complex function, we 


can look at y and w* as two independent functions. As in the case of Hamilton’s 
principle in mechanics, we find 


(11.34) 


spe) © vny) = Hives , (11.35a) 
6 

ee aa =x) 11.35b 

a" xy* by = w(x) ( ) 


We then get with (11.34) an eigenvalue equation for y: 


bl H(x) — WHIP v(x) = 0 or 


4 SWIM aor 

Hy (x) Chi) W(x) = Ey w(x), (11.36) 
which is exactly the Schrodinger equation. 

The variational principle (11.33), known as Ritz’s variational method, 1s 
therefore equivalent to the Schrédinger formalism of quantum mechanics 
(for stationary states). Under the additional condition that E, be the absolute 
minimum, we then get (11.32) and thus the ground state energy. 

Ritz’s method is used for many practical purposes; we proceed in the 
following way. The (test) wave function w(x, 1, .-.,%,) 1s made to depend on the 
real parameters %;; we then search according to (11.33) for the minimum of 


Ey (1, BO. o,): 


0 

(ones) — OS, Cle) 

Fj (Bvltas 9) 

In this way, we get an upper approximation for the energy in the ground state. 
It is also possible to use Ritz’s variational method for the lowest energy 

states of a special kind; for example, for the lowest state with angular mo- 
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mentum / = 0,/ = 1,/ = 2 etc. We then find the lowest energy states for / = 0 
or /=1, or for |= 2, etc. The test wave functions have to be of the same 
special kind, i.e. they have to be angular-momentum functions for / = 0, / = 1, or 
= LC. 

It is also possible to determine the second lowest state of the same kind of 
states if we demand that it be orthogonal to the lowest state. A great number of 
extensions are possible. 


A ————————EEEEEE——EE aE 


11.6 Application of the Ritz Variational Method: 
The Harmonic Oscillator 


We are searching for the ground state of a particle in the oscillator potential 
WAC \ ener oe (1) 
As test wave function we use 


> = Aexo( - 57] | 0) 


with A and / as free parameters. A describes, of course, the normalization of the 
wave function and is therefore trivial. The interesting parameter is 1. We get 


e ho ie 
HlgS 4 = en" i Ce 
lp» ( areas Fs Oe a ) Aero( 5 x ) 


=A MMS LEM teeta exp ee (3) 
2m / 2 : es 


and consequently 


2 
<o|H|p> = al F(a _ oe) + smo 


A De 2 Dae 
OU ae 
= cra (3 a 5 mw ) (4) 
and 
Tt 
olgy = ave (5) 
The energy as a function of / is given by 
<o|H 1 (hay ma 
oe Sa (6) 
<elo> 2A \ 2m 2 


11.4 Time-Dependent Perturbation Theory 


Hence, the Ritz procedure leads to 


OE = h?2 mo? _ 

a Tort Soe ; Y 
and thus 

: ma? ne 

ves = he => Ae = a ‘ (8) 


Therefore the ground state energy is 


h2A2 mo? 

Ee a 

o(4o) 4m ae 
ae] qe - (9) 


We see that in this special case, by using the variational method, we get exactly 
the ground state (see Chap. 7). The ground state wave function is then deter- 
mined by inserting 29 from (8) into (2). 


11.4 Time-Dependent Perturbation Theory 


One of the main tasks of quantum mechanics is the calculation of transition 
probabilities from one state yw, to another state y,,. This occurs under the 
influence of a time-dependent perturbation V(r, t), which, so to say, “shakes” the 
system and so causes the transition. The question of the transition of a system 
from one state to another generally only makes sense if the cause of the 
transition, i.e. V(r, t), acts only within a finite time period, say from t = 0 to 
t = T. Except for this time period, the total energy is a constant of motion, which 
can be measured. 

The change of the wave function while V(r, t) is acting is given by a 
Schrédinger equation. The solution of this equation, however, generally leads to 
great difficulties. General predictions can only be made if the transition is caused 
by weak influences, i.e. weak potentials V(r, t). These influences can be inter- 
preted as perturbations. 

If perturbations are already taken into account in the Schrodinger equation, 
it takes the following form 

ih = = Ay(nw + Vir, dw 
Here, Hy(r) is the operator for the total energy of the system without perturba- 
tion; the index O stands for the time independence. V(r, t) is the perturbation 
(perturbation potential). 


(11.38) 
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Example 11.6 


V(r,t) 


0 ie 


Fig. 11.2. General form of 
a perturbation in the time 
penod QOar= 7. Such 
a perturbation can be 
caused, for example, by an 
external field, which is 
switched on during this per- 
iod, or by a particle that is 
passing by. In the latter case, 
T is a measure of the colli- 
sion (interaction) time 
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For the calculation of the transition probability W,,,,(¢) from the energy level 
E,, to the energy level E,, of the unperturbed system [described by Ho(r)] it is 
advisable to use the E representation (energy representation). But first we look 
for eigenvalues of the unperturbed problem, Le. 


ea =A . (11.39) 


If the stationary part of the normalized wave function satisfies the equation 

Ho ("Wi () =Exy(r) , (11.40) 
then the time-dependent functions 

~ i 

U(r, t) = wiebexo( 5 But) (11.41) 
are the solutions of the unperturbed system. They form a complete set of 


functions and the solution of the main problem (11.38) can be expanded in terms 
of these functions, i.e. 


1 rs 
wir, t) = Y aucoutrrexo( a Est) = Yau, bcs (11.42) 
k k 
Inserting this into the i equation (11.38) leads to 
in Uh 


or, because Pe ire Hod, 


T= Deaton + Dav (11.43) 


iny TE alr) = 2a (Vain td. (11.44) 


After multiplication by w(r, t) this becomes 


ny = Vi exp] = ME = Ent 


= Dalya Vk =I Fi + (Ey = Ent : (11.45) 
Considering the normalization of the wave functions y, and the abbreviations 

Vnlt) =a? Wt Vy and Oy, = = ay (11.46) 
after integration over dV, (11.45) leads to 

ts - Sng Cnt = 2, a4(6) ect Meen (11.47) 


With @nn = 9, we finally get 


= = Lay Vay (thetomet (11.48) 


11.4 Time-Dependent Perturbation Theory 
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The frequencies @,, are sometimes called Bohr frequencies for the transition 
Em — E,. 


We assume that at the beginning (i.e. before the perturbation sets in), the 
system is in the state E,. So we have for t= 0 


wir, 0) = 2,4x(0) Welt, 0) = V(r, = Vln) (11.49) 


This just means that 
a,(0)=1 and a,(0)=0 for k#n , (11.50) 


and already suggests the interpretation of the a,(t). 
To understand this even better, let us first look at the normalization of 
Wir, t). We find 


L= fa? xW*(r, OW(r, 1) 
= 5 aj (t)a,-(t)exp Re = Bot |Je XW (rH) WV) 


Neale 


= Vi ak (thay (thei! dy: 
kk’ 


am |G (131) 
k 
The expansion coefficients a,(t) must obviously satisfy the normalization condi- 
tion for all times f, especially in the interval of the perturbation (0 < t < T). 


Now we want to discuss the meaning of the a,(z). At time t we can write the 
wave function wW(r, t) as 


W(r,t)= Y aula creno = - Bt) (11.52) 


The matrix element 


Cm(P) I(r, > = (ont Y altvatrexn( =. Est) 


= anttexo - 7 Ent] (ltes3) 


describes the overlap between the time-dependent wave function w(r, t) and the 
stationary wave function y,,(r). The probability of finding the state w,,(r) in 
Wir, t) at the time t with the energy E,, is given, as is well known, by the square of 
this teri, 1e. by 


Ym) Wr, OD 1? = lan (2)? (11.54) 


Since, according to the initial conditions [see (11.50) ], at t = 0, a,,(t = 0) = 6,,, 
holds and since, in general, a,,(t) 4 0 (for all m) for t > 0, it is evident that the 
quantities la,(t)|? give the probability of finding the system at time ¢ in the 
state Wr with the energy E,. Taking the initial conditions into account, |a,,(t)|? 
is the probability for the transition from the state @, to @,, in the period from 
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i = (0 ii@ 7° 
Wrn(t) = |dn(t)|? (L155) 


Now it is our task to calculate the amplitudes a,,(t) from the coupled differential 
equations (11.48) and the initial conditions (11.50). So far, the problem is clearly 
and exactly formulated. The solution, however, can in general only be obtained 
approximately and successively. We consider the fact that V(r, t) represents 
a small perturbation; in the absence of perturbation, the system remains un- 
changed in its initial state. Thus in zeroth order we can make the following 
approximation, considering only small perturbations: 


a(t) = om, (11.56) 


which means that we start the zero-order solution with the initial conditions 
(11.50). This approximation is used to calculate the next best approximation, 
as we insert this solution into the right-hand side of the differential equations 
(11.48) (successive approximation): 


ee 
as SN OUARO Ie = Loco (11.57) 


This procedure can be continued until we reach the precision desired or 
necessary. In general, the iteration procedure for the differential equations 
(11.48) can be formulated as 

dat 1) 


ih = Dal Vo (t)eiomet (11.58) 


We restrict ourselves to first-order approximation and find, after integration, that 
ihe 
(f= =) Vin(tem'dt + bmn - (11.59) 
0 


Now the qualities of the perturbation, mentioned above, are used. Thus we 
assume that V(r, t) = 0 for t < 0 and t > T. Further, we suppose that YV,,,,(t) is 
so small that the first-order approximation holds even for t = 7. Then we get 
fori => 7 


a(t) = n(T)ermn dr 


o—_ 
3s 


wo 
=— [ Viaj(t)e@r'ds , maAan . (11.60) 
This means that, in particular, a(t) is constant in time for t > T. It becomes 
a constant of motion for t > 7. The perturbation has ceased and the system has 
settled into a new state. 
Let us now study the meaning of a‘)(t) in greater detail. For that purpose we 
note that the perturbation can be expanded in a Fourier series: 


Vir i= | Vie des do (11.61) 


2 120. 


11.5 Time-Independent Perturbation 


According to the theorem of Fourier integrals, the Fourier component V(r, @) 
is then 
te 6} 


1 
Vir, y= f Vir, ede (11.62) 


For the matrix element (11.46) we then find 
Vent) = Ja xWRNVir, Dvn), 


ico) 


= Je dol @xpkNVir, ov), 


le) 


fe Vode , (11.63) 


==1Ke 8) 


where Viin(@) is now the matrix element of the Fourier transform V(r, w) 
because, according to the Fourier theorem and in analogy to equation (11.62), 


90 


1 
ee = A enh : 3 
() gts (Nenad (11.64) 
Comparing this with the expression for a“)(r) (11.60), we find the relation 
aR 
ay’ (t) = jh Vinn( mn) 3 ( 1.65) 


Thus we obtain for the transition probability 


Z 


4 
Wont) = > Van mn) See 7k (11.66) 


Hence, for times t > 7, the transition probability W,,, is constant and — as we 
see — is only nonzero if V,,,(@mn) 4 0, too. This means the transition from the 
state y, (level E,,) to the state y,, (level E,,) is only possible if the frequency 
Omn = (En — E,)/his contained in the perturbation spectrum, ie. in the Fourier 
spectrum Vinn(@mn) of the perturbation [see (11.63) ]. Thus the transition exhibits 
resonance behaviour. 

Obviously, we have the same situation as with a system of oscillators with 
eigenfrequencies which are equal to the Bohr frequencies w,,,. If an external 
perturbation occurs that varies in time, then only those oscillators are stimu- 
lated which have an eigenfrequency that is included in the Fourier spectrum of 
the perturbation. 


11.5 Time-Independent Perturbation 


If 


rian} = 8 for Oe 7 


i (11.67) 
= 0 otherwise, 
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0 re t 


Fig. 11.3. Perturbation con- 
stant in time over the period 
Ogr<er 


Fig. 11.4. The function 
f(t, w) switches over to 2ntd 
(w) for t > 0, ie. the max- 
imum at w=0O becomes 
increasingly sharp 


ie. if the perturbation is nor time dependent while acting in the period 0 <1 < T 
(see Fig. 11.3), then the integrals can be easily evaluated, and we find from (11.59) 
that 


1 oe 
AO oe AL (11.68) 
0 
with PMs) 
eae 4.40 2 
He Ole = Ni pep — cos at) : (11.69) 


As a function of w, the quantity f(t, w) takes the form shown in Fig. 11.4. 
It has a well-defined peak at w = 0 with the width 27/t, which becomes more 
distinct and sharper with increasing t. 


f(t,w) 


The following relations are even exact (compare with Exercise 11.10) 


J f(t, o)do = 2nt (11.70) 
and 
lim f(t, @) = 2ntd(w). Cla) 


For a fixed value of t, the probability W,,,, in (11.68) depends in a simple way on 
the final state m. Up to a constant, it is the square of the perturbation matrix 
element |V,,,|7 multiplied by the factor f(t, @,,,) that depends on the Bohr 
frequency «,,,, of this transition. Since this weighting factor f(t, @,) has a well- 
defined peak with the width 2z/t at o,,, = 0, transitions will mainly occur into 
such states that have energies in a band of the width 0E = 2zh/t around the 
energy of the initial state. This means: the transitions conserve energy up to 
a value of the order 6E = 2mh/t. For t > oo (and therefore 7 — oo) there are no 
transitions. This is intuitively clear, because a perturbation which is constant for 
all times cannot induce a transition; it does not “shake” the system. It is not 
surprising that here all frequencies occur, because the Fourier transform of 
a function constant in time over a certain period contains all frequencies except 
eventually those of a countable subset. 
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11.6 Transitions Between Continuum States 


So far we have considered an “unperturbed” operator Ho(¥), which has only 
a discrete spectrum. We have also used a formalism which presumes that the 
states are not degenerate. By suitably changing this formalism, we can of course 
apply it to degenerate states. The generalization for a continuous spectrum 
is somewhat more complicated, but very often of practical importance, for 
example: the ionization of atoms (transition from a discrete bound state to 
a continuum state — see Fig. 11.5a) as a consequence of the perturbation field of 
a charged particle that is passing by, or the bremsstrahlung (continuum- 
continuum transition — see Fig. 11.5b) of charged particles as a result of 
acceleration or deceleration in the field of other particles. Let us now discuss this 
problem from a general point of view. 

If the operator H,(r) also has a continuous spectrum (see Fig. 11.5), we have 
as eigenfunctions 


Hot (r) = Exya(r) and Hoy,(r) = E(a)W,(r) 


Here, « is a continuous index that characterizes the continuum states of the spec- 
trum. The stationary solutions, belonging to the time-dependent Schrodinger 
equation, are accordingly 


Valr, 2) = vncnero( = 5 Est] » vr, 8) = yopexp( a 5a) 


(72) 


(11.73) 


For the normalization of the eigenfunctions of the discrete states, we again have 


TWEO, Dkr, DdPx = Se (11.74) 
For the overlap integrals between v7 and Was it holds that (because of the 
orthogonality of these states) 


(VE OW, 0aPx =0 (11.75) 


The normalization of the wave functions characterizing the continuous spec- 
trum, however, is given by 


J tt, Oale, nds = Te Sta — 2) 


where n(x) is a positive function of « Obviously the functions w,(r, t) = 


./n(a) Wir, t) are normalized to 6 functions (see e.g. Chap. 5). 
For the solution of the “disturbed” problem, we must use all eigenfunctions, 
i.e. the complete set, and so we obtain the linear combination 


(11.76) 


Wir) = Dahle, ) + Ja.(Ovalr, Neda (11.77) 
k 
Inserting this into the Schrodinger equation 
in Se = (fly + Ves OW (11.78) 
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Continuum states 


Ionization limit 


Discrete bound states 


a) 


b) 


Fig. 11.5.a) Transition from 
a discrete to a continuum 
state during ionization; 
b) transition from the con- 
tinuum state Wr, to the 
continuum state Wr, in the 
bremsstrahlung. At the same 
time, a photon with the 
energy hw=E,—E, is 
emitted 


E 


E(a) 


Ex 


Fig. 11.6. Spectrum with 
a discrete (E,) and a con- 
tinuous E(a) part 
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yields 


in( Sa + + [Oia tx) Sayin E 4 fag(i)ih 


= Ya, Hoy, + ire oe Vie t+ falt)Viide , (11.79) 


Osa 


so that 


in( SE + j= 


remains. Now, proceeding as above [see (11.43ff.)], we find, after multiplication 
by the wave function wW# or w%, that 


da,(t) 


© da) = Lane + fanll (t) Viy,da (11.80) 


ih » Ghexp be = bay |oe Wi 


da,{t) 


Plas 


fe F (B= bic) |e dnd 
= » ay €XP E oe = byt |e Vy, 


+ Ja,(t)exp F (Ex —B(a)y\vE Vd, (11.81) 


da, 


ih » a oP Rea = Bai WE 


cu jae exp IE (E(a’) — Et Jue vada 


= Sasex| > (E (a) — byt lve Vy 


+ Jastexp| (Et) — Be) = Vy dee (11.82) 


Integration over the space coordinates, and taking into account the normaliz- 
ation and the overlap integrals, leads to 


0a 1 1 
ih — exp Re i Bt | = Yay exp Be aS bt |jexur V hy, 
k k 


+ ff a(t) jexp] 8 w= Boye | i Vip,dad?x (11.83) 
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and 


inj exp [ee 7 B4a) wey B= aM 


a ext Raa — bay [fas wa Viy 


+ fd°x{a,(t) exp Rea _ E4oyy lus Voda. (11.84) 


Now, its is advisable to use for the interaction matrix element the same symbol 
as in (11.46), where the indices o and t can stand for both the continuous and the 
discrete spectrum. Thus 


V(t) =faexpivir, dy. . (11.85) 
In the same way, we generalize the Bohr frequencies: 
1 
oO 7 Ee ee (11.86) 


Here, either E, = E(a) or E, = E, is possible, depending on which part of the 
spectrum the index o corresponds to. Then the coupled system of differential 
equations takes the form 


vi a = ee fa,(Qei YeAthda (11.87a) 
k 
1 da,(t ’ 
in AO < y ayetn Vogl) + fale KalOda (11.876) 


Ged | 


We already know the meaning of the a, from above; for the a,(t), the situation is 
slightly different. To determine their significance, we proceed as before in (11.51): 


ad ey Yak ay cl (Px WE ye 
kk’ 


+ ¥ fda G a,(tyei=t (dx Witty, + ak (aye f dats) 
k 
a {doe { da’ ak (ta, (the! ide wr Wy 


as 
= D lal? + J dar | da as (tha, (tei~" d(a — a’) 


= Jal? + faala,(0? =a | (11.88) 


Just as we expected for wave functions belonging to the continuous spectrum, 
only the expression 


: ae 11.89 
Jat ya (11.89) 


Sau 
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has the meaning of a probability. More precisely, (11.89) gives the probability of 
finding the system in the range of states between « and « + dz. 
Projectors 
E We denote by B(a) a small, connected range of values of the parameter a 
(this corresponds to a group of “neighbouring” states; see Fig. 11.7) so that the 
B(a) { E(a) operator 
Ey Py = Hy daw,(r, n(x) wer’, d (11.90) 


is 


Fig. 11.7. The domain B(a) 
of the continuous spectrum 


represents the projector (projection operator) onto those states contained in the 
interval and characterized by the values of the parameter « within the range of 
values B(a). The projector functions as follows: 


Pay = § day,lr', t)n(a)f ax’ Wer’, OW, OD . (Ee 


B(x) 


The action of Pz on a wave function wir’, t) is defined by the integral over r’ 
in (11.91). The projector is somewhat similar to the Weyl packet (eigendiffer- 
ential — see Chap. 5). 

Now, if w is any wave function, we find from the expansion 


i Lantale' t) )+ fda’ Qy welt’, t) 


= 24 exp( a ae W(r') + J da'a, coer ( = 3 = E(a’) ote) (11.92) 


Pa =f daihalr’, )n(a) Davero| EC )- Eo | fx! VEO Wel) 


B(x) 
a {da (exp +7 (B@)-E cr [fare vee yyy (¥') } 

= | da ,(r’, t)n(c) )f daa, win ae pF lw) — rene | 2 — a’) 
B(x) n(a) 

= f daa,( (t)w, (es t) (11.93) 
Bia) 


holds. The application of the operator P, thus causes the projection of the wave 
function onto that domain of states w, which is characterized by values a within the 
interval B(a). This explains the name projector. If we now consider the energy 
E(a) as a new variable (i.e. we transform from « to the energy E), and if we name 
the corresponding range of values B(E), we can also write the projector as 


Po= f dEV(r',Do(E Wi.) , (11.94) 


B(E) 
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with 
dx 
0,(E) = n(a) — TE (ie >) 


This quantity 0@,(E) is called the density of states « at the energy E. By inspection 
of the operator (11.94), we see that, indeed, 0,(E) is the number of states per unit of 
energy. We shall now examine two properties of Pp. 
a) Idempotency: P} = P, 
Indeed we find with y = Fa We aE {daa, (tw, that 

Paw oS Pa( Pp) 


Bia) 

= | da! W(r’, t) n(a \f dx’ Pe", t) { daa,(t)yalr’, t) 

Bic’) Ey 
= f da'b,(r',0n(a’) J daxan(t)§ dx’ WE(r', thba(r’, t) 

Bia’) B(a) Fh 

[1/n(a)]6(a — a’) 

= { doa,(t)i,(r,t)= Pay . (11.96) 

B(a) 


Since this relation must be true for any w, we can conclude that the idempotency 
relation P% = Ps, holds generally. 
b) Hermiticity: Pj = Ps 
If 6 = bid + { dab, (t),, then it follows that 
Pao = ( dab,( t)w,(r’, ie (11.97) 


B(a) 


We investigate the action of the operator Pi for arbitrary ¢ and w by consider- 
ing the integral 


[ d3xp* Pi w = { d?x(Pab)*y 
= jars( [da b,( (t)W,(r', ny (x aw, + jda'ani0h¥) 


B(a) 
= f dabs(t a( a fexpth + fact (0 | xb Ede do) 
B(a) WU —__—_/ WY 
=0 [1/n(a)]6(a — a’) 
} 
= dab*(t) { da’ a,-(t) — 0(a — «’) 
J dab8(o f dan 5 
i 
= { dab¥(t)a,(t)—~ (11.98) 


Aen n(a) 
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On the other hand, we calculate 


[xt Pal = 5 x(S but + J da’bo (Ue) { daa,(t V(r", t) 


B(a) 


= J das (Ebi farxbiy, + [630 [a xPt Gea’) 
Sy 


B(a) od 
= [1/n(a)}d(a — a’) 
1 
= daa,(t) | da’ b*(t) —~ 6(a — a’) 
a (t) ( aa 
= | dubs(t)a, () a = fax Pi we (11.99) 
B(a) 


As this relation is valid for any ¢ and y, it follows that Pj = Py. This means: 
P, is Hermitian. 

Now we can return to the calculation of transition probabilities. Because 
Pzw yields that part of the wave function that lies in the range of neighbouring 
states w, within the interval characterized by B(«), the absolute value 
{|Psy|?d?x is just the probability W, of finding the system in the states 
within B(«): 


We = | |PaW|?d3x = f d?x(Pay)* Pap(r’, t) 
def 
Sha wer, Pe — a Pe Pa) 


= fax(Souhers + fdeeantdete0) J daa,(t)pdr', 0) 


B(a) 
= J da’ ap(t ) J daa,(t)— nan 6(a — a’) 
Bla) 
— * 1 _ a,(t) a 
7 ae er (ee 4 HW a n(x) o 
oe: 
BIE) ae QAE)dE . (11.100) 


It is now easy to determine the transition probability if the system is initially in 
a state n of the discrete spectrum, i.e. a,(0) = 6,,, a,(0) = 0. Hence, in first order, 
we get from (11.87): 

da. 1 da) 


dt = eee Vin(t) S Seaoe ih 


i n(a) dt 


See) (11.101) 


For the discrete part of the spectrum, there is no change compared with the 
earlier result [see (11.57)]. Therefore we shall further examine here only the 
second part, which describes transitions into the continuous spectrum. (Such 
problems arise, for example, in the case of ionization, when particles that pass by 
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perturb the system. After integration of (11.101) we find that 


a(t) 1! 
= — V. 1M@anT 
n(a) in! gl QE te (11.102) 
and 
(Dy 2 
Th n= Me! mans 
B(E) n(a) 
t 2 
S=— | lee re | mas (11.103) 
h B(E) | 0 


because if the transition occurs out of a particular state, then the transition 
probability is just the probability of finding the system in states y,(E) of the 
range B(E) at a later time. Note that « = a(£) in (11.103)! 

If the initial state (characterized by « = f) belongs to the continuum, then 
a,(0) = 0 and a,(0) = 6(f — «). The result is completely analogous, namely 


a(t) 1 
n(a) ih 


t 
hoe we dt. (11.104) 
i) 


This leads to the transition probability W,..,; 


2 


Q(E)dE . Guelth)) 


ie! 


(E) 


iE 
Wn = aq f |f Volcdetode 
0 


If the perturbation is constant in the interval 0 <f < 7, Le. 


ee 0) on Ose ee IF 
V(r, t) 


= 0) otherwise, 
then 
Vag(t) =f Bx Vin)bs = Vas» (11.106) 
so that 
1 
Weop=—z J \Yol’ f(t @aploaE)dE , (11.107) 


he B(E) 
where again [see (11.69) ] 


2 


a) = = tl —coswt) . (11.108) 


t 
{ aies dt 
19) 


In (11.107), the squared matrix element | V,,|* cannot be taken out of the integral 
because it is related to the integration variable by a(£). Otherwise (11.107) is 
totally analogous to the earlier result (11.68), only that, here, the density of the 
final states o,(E) appears. If these final states are concentrated around a state 
with the energy F;, ie. Q,(E) = 6(E — E,), then (11.68) can be immediately 
deduced from (11.107). 
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TON NTA? bm 
11.7 Transition Probability per Unit Time — Fermi’s Golden Rule 
Here we wish to examine the transition to states Wy, within the energy interval 
[E, — (¢/2), Ey + (s/2)]. The width « is chosen so small that in this interval 
Vg and Q,(E) can practically be considered to be constant, i.e. independent of 
« or E. Then we can write for the transition probability (11.107): 
1 
Woe = 7a|Vesl?Qn(E1) J f(t, @ap)dE , (1) 
h B(E) 
where now B(E) = [E|E, — (&/2) < E < E, + (¢/2)]. 
If the time is chosen large enough, so that ¢ gets much larger than the 
oscillation frequencies contained in the function f(t, @), 1.e. so that 
2th 
&> “ : (2) 
[compare with Fig. 11.4 for f(t, w)], then the remaining integral in W,., can be 
evaluated rather straightforwardly. Yet, two cases must be distinguished. 
3 a) The central peak of f(t, @) lies outside the integration interval (see figure), 
Ey Le. 
B(E) =2e states nh 
( ) Wares) E, —E,;>é> - (3) 
Obviously, in this case, the system changes its energy under the influence of the 
Eg perturbation (no conservation of energy). With 


In case a), transitions from 
the state y, with the energy 
E, into the continuum states 
We) in B(E) around E, are 
considered 


G(,0)= rit — cos mt) 


we obtain 
ae 2 sani (ai Ph 
f(t, Oop )dE = ———— dE — —cosmtdam . (4) 
B(E) yawn (f= Es) (E:—e/2—Es)/h w? 


The second integral makes a negligible contribution because, according to 
the assumption (2) for 9, the integration interval contains many oscillations and 
cos wt repeatedly takes on all values between —1 and 1. Therefore 


2h Ee 2hs DOR 
| (Gee = =. iE Pll LS ee) 
B(E) es Es E,-¢/2 Ci Es) Sf ea — Es) 
and the time-independent expression for the transition probability results: 
26| Vig\? 
Wop.) = > (KE) .. 
s-FEU OE, = F,)2 Q(E,) (6) 
We now consider the case 
anh 


b) Ey —Epee>—— . (7) 


11.6 Transitions Between Continuum States 


Here, the central peak lies within the integration interval, and the main contri- 

bution to the time integral comes from that part (see figure below). We make 

only a small error by moving the boundaries of the integration to + oo, Le. 
cos wt 


J f(t, o,g)dE 2h { pews OL ; (8) 


B(E) w? 


The value of the integral is determined according to the residue theorem: 


“ 1|—cosat z 
2 ue ——_dw=2nt , thus | f(t, Wyg)dE = 2rht (9) 
B(E) 
and with this 
2n 
Wea) = R |e zp(E)|* Oa(E)t (10) 


results. Here, we have written E as the argument, because the main contribution 
results from w = 0, i.e. for E, = Eg = E. Since the final state has the same energy 
as the initial state, this transition conserves energy. Furthermore, because 
& > 27h/t, the transition probability is larger than the sum of all other parts. 

Now we introduce the transition probability per unit time w,.,, which is 
naturally defined by 


dW,.. 
Wyn =P (11) 


Since it was found in (6) that W,_,., is time-independent for those transitions 
which do not conserve energy, w,., vanishes. For energy-conserving transitions, 
however, we find the important formula 


Dire 
Pp>B(E) = h | Vig(E)|? Clee (12) 


It should be emphasized again that the matrix elements which appear and the 
density of states are assigned to states «, which have the same energy Eg as the 
initial state. The expression (12) is called Fermi’s Golden Rule. 

The conditions for the validity of this formula are evident from its derivation. 
Let us recapitulate the two most essential assumptions once more. It is necessary 
for the time t to be long enough to guarantee that ¢ > 27h/t; on the other hand, 
time t must be short enough to justify the first-order perturbation theory 
approximation, le. w,+gt < 1. 


EXANCS 


11.8 Elastic Scattering of an Electron by an Atomic Nucleus 


In this example, we give an application of Fermi’s Golden Rule and simultan- 
eously illustrate some concepts of scattering theory. 
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In case b), Ex lies within the 
range B(E) 
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electron 


atomic nucleus 


An electron is scattered by 
an atomic nucleus 


11. Perturbation Theory 


We examine the scattering of a high-velocity electron by a nucleus with the 
charge number Z. The electron-nucleus interaction — in our case the Coulomb 
energy — is treated as a perturbation: 


He.) = 2 exp ( 25) : (1) 


Let r mark the coordinates of the electron, R those of the nucleus. The 
exponential factor in (1) is introduced because of the screening effect of the 
electron cloud on the nuclear charge. We need it for mathematical reasons, too, 
since it helps to avoid divergencies when integrations are performed. The length 
d is a measure for the shielding distance. For |r — R| > d, the interaction 
disappears, because the charge of the nucleus is then completely shielded by the 
bound electrons. 

Before the scattering, the whole system is described by the state |;> and 
afterwards, by the state |w,>. We want to calculate the probability for the 
transition |W;> > |W,-> in order to learn something about the structure (charge 
distribution) of the nucleus when comparing it with experiments. The electron is 
in a state of motion with momentum pp and energy Eo before scattering occurs 
and is scattered by the Coulomb field (1) into a state with p and E. Since these 
are continuum states, the transition probability per unit time is given by (12) of 
Examples tt. 7 


2 
w= Flv VM>P eB) (2) 


The total wave functions of the electron-nucleus system are products of the wave 
function of the electron and that of the nucleus. We use plane waves as an 
approximation for the wave function of the electron. This approximation is called 
the Born approximation. But it is only valid if the electron-nucleus interaction 
is small, ie. the nuclear charge should not be too large and the velocity of the 
electron has to be great enough. These conditions are summarized by the 
relation 

-< (ire (3) 


Thus the wave functions are 


Wi= exp(;,p0°r] @(R) , 


= exp (;p-r) blk) , (4) 


or, written in another way, 


Wir = kod lI. Wed =1ADI ED (5) 


This can also be expressed as 


‘ 1 
(r|Kko> = exp(ikg:r) = exp (5 P0-r] 5 
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CA —exp(ik ir) — exp( 0°") : 


CRlid = OR) 

CRID = O(R) , (6) 
a form of notation used by some authors; we will also use it occasionally (see, e.g. 
Chap. 15). 

The @(R) are the normalized wave functions of the nucleus. We did not 
normalize the plane of waves (4) — based on (11.76) and (11.95) -, in order to 
define the density of the plane waves in a general way (i.e. also for plane waves 
which are not normalized on 6 functions). 

The ket vectors |k> form a complete set of orthogonal functions: 
d°*k 

ale 1 : 
aaa (7 


Chk’ = (27)? 5(k— Kk’), S1KD 


These formulae will be necessary when calculating the density of states. But let 
us now evaluate the matrix elements. Since the wave function factorizes into an 
electronic and a nuclear part, the volume element dV also implies integration 
over both volumes dV, (electron space) and dV, (nucleus space). Thus 


exp(— |r — Ri/d) 
Ir—R| 


CWelVIWi> = Ze? J P*(R)O(R) J mance: 


(8) 


The indices e and n stand for “electron” and “nucleus”, respectively. We first 
calculate the integral over the electronic coordinates and abbreviate ky — k by 
the vector s (see next figure): 


nk 9) 


Therefore hs = po — p is the momentum transfer from the electron to the nucleus 
during the scattering process (see figure). The integral over dV, is evidently 
a function of R: 


exp(— |r — RI/d) 


a elo Wry (10) 


J(R) = J 


Since we integrate over the whole space, we can change our system of coordi- 
nates and integrate over the whole space again (the integration boundaries need 
not be changed). We replace r by r’ = r — Rand introduce spherical coordinates 
with 9 being the angle between s and r’ (see figure below). 

In terms of these coordinates we have 


dV, =r’ dr'sin d3 do (11) 
and, from r= r' + R, we get 


(cy — ees ee sk 
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The vector hs describes the 
momentum transferred from 
the electron to the nucleus. 
Po = hk, and p = hk are the 
momenta of the electron be- 
fore and after the collision 


f 


The new coordinates r’. 
The vector s is taken as 


=sr'cos$+s°R . (12) polar axis 
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ee ee 


Example 11.8 Thus the integral becomes 
eld 


Je(R) = [ele ove Fr sin G dr! dd de 


rt 
= Dye || pee fee Fan Gadar 
0 


Peal . . 
= Tre" *— (ey Se ar 
is 


J 
10) 
2 a aE 1 eis if er’ PS 
i. is \is= id “see a 
2 give za + ae 
mis \is— ld” is + 1/d 
4n 


ay eee 13 
s+ 1d? (13) 


lea) 


0 


This result can be simplified: the term 1/d? in the denominator can be neglected 
ifs*d? > 1. This means that the momentum transfer must not become too small. 
Hence, 


J(R) © 8 S (14) 


Therefore the matrix element takes the form 


(Wel Viva> = Ze? Ff GR Re G(R) AY, (15) 


For elastic scattering, the state of the nucleus is not changed. Let the nucleus be 
in its ground state @. Then we have ¢; = ¢; = @, and the product Z@* ¢(R) is 
the density distribution of the protons in the nucleus. Instead of the wave 
functions, we can introduce the charge density 0,(R) of the atomic nucleus 
(more accurately: 0,(R) is the charge density without the factor e, which we have 
explicitly taken out): 


Zo*>=0,(R) with Jo,(R)dV,=Z . (16) 
For further simplification we assume spherical symmetry of the charge distri- 
bution: 

Op K) enn) ae (17) 


This assumption is only valid for atomic nuclei in the vicinity of the magic 
numbers. The others are prolately (cigar-like) deformed. Therefore we obtain for 
the matrix element 


11.6 Transitions Between Continuum States 


ae eRe a 


52 


4ne? 
Fink iS (18) 


The quantity F(s) is called a form factor. \t is the Fourier-transformed charge 
distribution and reflects the deviation of the nuclear charge distribution 
from point structure. Indeed, if the nucleus is assumed to be pointlike 
lpevouR)— 0 (R)), we get f= 1, 

The form factor F(s) = | 9,(R)e"’* dV, can be further evaluated by again 
introducing spherical coordinates and using the axis defined by s as the polar 
axis. Hence, we get 


dV, = R*sinGdRdGdp and r-R=Rscos9 , (19) 
and therefore 
F(s) = 2n | | o,(R)** °°" R? sin $dR dd 
0 0 
7 | 0,8) ( ee Vd (Cos ») R*dR 
0 0 
ee | eae ; 
= Dp; R peliten S/R i UST @ 
T J onl lags (e ea )R7 dR 
Aes : 
a, J o,(R)sin(sR)R dR. (20) 
0 


The last integral can be calculated only if the charge distribution @,(R) is 
known. We will consider this point later (see Exercise 11.9). Our present result 
can be summarized as 


4ne? 


Wl Vid = F(s), (21) 


a 


and we can turn to the calculation of the density of states. The orthogonality and 
closure relation (7) implies that n(a«) equals (2x)? if « is identified with k. In 
other words, in the space of the vectors k, the density of states is constant and 
equal to (2z)°, ie. the number of states in the interval [k, A + dk] is equal to 
(2x) 3d*k. If we had used plane waves normalized on 6 functions from the 
beginning, the density would have been n(k) = 1. 

Now we are interested in states with momenta pointing in a certain direction 
Q. These momenta differ from each other in their energy only. Therefore the 
density of these states is 0(2, E), i.e. o(®, E)dQdE is equal to the number of 
states which have a momentum pointing into the solid angle [Q, Q + dQ] and 
whose energy lies in the interval [E, E + dE]. Thus we get 


ak 


ane 2 
Onp (22) 


0(Q, E)dQ dE = 
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By introducing spherical coordinates in the A space 


@k=k?dkdQ , we get (23) 
k? dk 

QdE=-— dQ . 24 

o(Q, E)dQd Oni 4 (24) 


We note that the density of states is independent of Q, which allows us finally to 
write e(Q, E) = o(£). Substituting the momentum p for k, the density of states 
o(E) becomes 


ai ee a 
OE) = amp dE Qnh dEldp Ce) 


To calculate the derivative, the energy conditions of the collision have to be 
examined. We assume the electron is very fast and therefore proceed from the 
relativistic energy-momentum relation 


ai Poe’ oe mc ae Me? = ./ pat me Se Vi cee 


If the kinetic energy of the electron is large enough compared to the rest energy, 
the term mc? can be neglected and we obtain 


IB, 
—=potMc=pt./h?s?+M?c? . (27) 


G 


(26) 


According to the definition of s, we get 
h7s* = pi + p*—2popcosO . (28) 


Because of the great mass difference between nucleus and electron, the energy 
transfer can be considered small compared to Mc?, and therefore p = po and 
(p — po)? = 0, so that p? + p% = 2ppo results. Thus we get 


O 
h? s* = 2pop(1 — cos @) = 4popsin Par (29) 
From relation (27) we find for E/c that 
(Po — p+ Mc)? = h?s? + M2? 
ee! 
= 4pap sin? as ogres (30) 
On the other hand, we have 
(Po — p+ Mc)? = (po — p)? + 2(po — p)Me + M?c? 
= 2(pp — p)Mc+ Mc? , (31) 


because po — p = 0 and thus the square (p — po)? is vanishingly small. If we 
compare the two last relations, we obtain 


20 
(Po — p)Mc © 2pop sin’ , (32) 
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and with this, finally, 


Po 
1 + (2po/Mc) sin? 0/2 


Now the expression dE/dp has to be calculated. For this purpose we start with 
E=pe+./h?s?c? + M2c* , (34) 
from which we get 
dE ” he (as jap) 


—— te a a 
dp PO ee 


p= (33) 


ra hoc eds 
Gea Vee dp (35) 
and, if we make use of (29) and (33): 
dE 2 ee. Po 
aE x c(1 + Hetsin? S ) = 088 : (36) 


Finally, we get the following expression for the density of states (25): 


2 3 


oe ee 
(2xh)? cpo (2h)? cpo 


With the help of Fermi’s Golden Rule expressed in (12) of Example 11.7, and in 
(21) and (37), the transition probability per unit time is given by 


Q(E) = 


(37) 


2n A , _ ee Ane? aa ie He 
Det vel yi) ot) = = (2 © ) |F(s)| 


2n ae Epo 
see (s)/? 1 pt 
= —_— |F(s 
h (4po p/h?)* sin* ©/2 (2h)? cpo 
4e*|F(s)|? : Bee lap 1 
ee | F(s)?_ . (38) 
16pgp~ sin* 0/2 cpo 2po) 6 po sin’ ©/2 


The transition probability itself cannot be measured directly, but a quantity can 
be observed, which is called a cross-section or, more accurately, a scattering 
cross-section, and is denoted by o or do; respectively (see figure). eis ailic 
number of particles scattered per unit time and per unit of the incoming particle 
current into the section of the solid angle (Q,@ + dQ). Since the states |k> 
represent particles, the current of which is v, we obtain for the differential 
cross-section: 


————— (39) 
vj 


The electron velocity v; can be set approximately to the velocity of light c; thus 
we write 


ihe eu 1 
Tint = Pit = ( 2 : |F(s)P (40) 


dQ 2poc] po sin* O/2 
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and the scattered wave 
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The charge distribution 
4no(r) of the lead nucleus, 
determined by elastic elec- 
tron scattering. g(r) is ap- 
proximately constant inside 
the nucleus as far as 
r= Sfm and _ decreases 
in the surface region with 
athickness of = 2 fm. Other 
nuclei show similar behavi- 
our: whereas the surface 
thickness is nearly equal for 
all nuclei, the radius Ry, at 
which 4zo(r) has its half- 
maximum value, changes 
HCcOnaine tO Ry —tyA-, 
where rp ~ 1.2 fm, and A is 
the number of nucleons 


Il. Perturbation Theory 


This is the extended Rutherford scattering formula for the cross-section. Substi- 
tuting p/po from (33) yields finally 
1 


dots | ay a ee (41) 
dQ 2poc/ sin* O/2 1 + (2po/Mc)sin? 0/2 


The effect of the recoiling nucleus is taken into account by the last factor. If the 
atomic nucleus is very heavy (M —> oo), this factor is nearly 1, corresponding to 
a scattering without momentum transfer. The form factor |F(s)|? takes into 
consideration properties (extension) of the nuclear charge density. It can be 
experimentally deduced using (41) and comparing it with the experimentally 
determined cross-section (by measuring the differential cross-section). From the 
thus determined form factor, the charge distribution can be calculated according 
to (20). Robert Hofstadter, who made systematic measurements of this kind, was 
awarded the Nobel Prize for this work in 1961. Some of the most accurate 
charge distributions of atomic nuclei were measured in the same way by Peter 
Brix at the electron accelerator in Darmstadt. 


ap(r)x107 [fm] 
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By way of example, we show the charge distribution of the lead nucleus (see 
figure before). It is nearly constant inside the nucleus and has a surface thickness 
ole 2 ini: 


SS) _—SSSSE— aE a 


11.9 Limit of Small Momentum Transfer 


Problem. Show that electron scattering with small momentum transfer permits 
the determination of the total charge and mean quadratic radius of atomic 
nuclei only. 


11.6 Transitions Between Continuum States 


Solution. To solve this problem, we start with the structure function (20) in 
Example 11.8: 


F(s) = =f op(R)sin(sR)R Ca (1) 


Assuming small momentum transfer s, or, more accurately, sR < 1, we can 
expand sin(sR) to obtain 


_ (sR)? 


sin(sR) x sR (2) 


Thus F(s) becomes 
F(s) = 4m { 9,(R)R2dR — =e { op(R)R4 dR = Z — = AC a) 
0 0 


The first term is just the total charge Z of the nucleus, while the second one 
contains the mean quadratic radius. To measure more details of the charge 
distribution 0,(R), the momentum transfer has to be increased [compare with 
(29) in Example 11.8]: 


(G) 
(is)? = 4p, psin7 on (4) 


This can be done by increasing the energy E of the electron and simultaneously 
increasing the momentum pp ~ p. Of course, we should take the best scattering 
angle possible (0 = 180°), i.e. we must detect the backward-scattered electrons at 
higher energy. Then the next term of the sine expansion becomes important, so 
that one gets 


_ (sR)? (sR)? 


sin(sR) x sR 71 os 51 ; (5) 
with the following result: 
Le aes a eee 
PS) CS ear <R a CR ae (6) 


We recognize that the different factors in front of the powers s?" of the form 
factor reflect the higher moments of the charge distribution. 


RRS ead 


11.10 Properties of the Function f(t, o) 


Problem. Show that the function 


f(t, @) = 1/w? ei — 1/? 


2D) 
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ig eee 


Exercise 11.10 fulfils the following relations: 


2 


f(t,@)dw = 2nt and 


lim f(t, @) = 2ntd(@) 


t> 0 


Solution. According to (11.69) the function f(t, @) is defined as 


2 
f(t, a al —cosat) ; (1) 
therefore 
[ f(,a)do=2 f he (2) 


By substituting wt = x, the integral becomes 


eas Sax i : 


— 0 = Be 


— COS x 
x? 


Cxae (3) 


which can be integrated by parts: 


ei (le SOM ee 
(ee (4) 
— 00 x 0 

With the help of the relation 
on 
See ay er Se SU (5) 
0 


and interchanging the integration, the integral (4) can be evaluated: 


00 = ice) co 
{ax = f sinx fe du dx 
cn 0 o 


a0 
dive \tee =" sinnadce 
0 


du , lo as 6 
= = arctan = 45 
1+u? go (6) 
Thus we get 
[RACE COD IE = a (7) 


To solve the second part of the problem, we use the representation of the 
6-function (see Example 5.2): 


I fe t 
Voss ith 
Ue pee wort 


(8) 
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With the help of (1), we obtain 


ee 
lim f(t, @) = lim — (1 — cos at) 
D 


t-* 0% i Sen die 2) 
1 (1 — cos wt) 


= 2x lim t —- ———_.—— _ => 2at6 : 
eau : or => 2ntd(w) (9) 


EXECS! re 


11.11 Elementary Theory of the Dielectric Constant 


Let Ho be the Hamiltonian of an electron with charge —e, e.g. in a molecule 
(to simplify the problem, the spectrum is assumed to be discrete). An incoming 
plane monochromatic linearly polarized electromagnetic wave shall not be 
influenced by the polarization of the molecule, Le. its frequency @ shall be clearly 
different from all absorption lines. It can also be proved that the contribution of 
the magnetic part of the wave produces negligibly small effects. 


Problem. a) Under these circumstances the wave can be described by a homo- 
geneous external potential, which is periodic in time and has the amplitude Fo 
and the frequency w. Find the related Schrédinger equation, if the z axis points 
in the direction of the oscillation. 

b) Let % be the ground state of Ho with the energy Ey. Take 


(x,t) = votxexp( = 7 Bot Seay (ae) (1) 


and find the first correction y“ to the “stationary ground-state” in the time- 
periodic potential. 


Hint: Set 
p(x, 1) = w.(x)exp | = ; (Eo ate ho | 


+ w_(x) exp] _ + (Bo — ho | : (2) 


c) In the absence of an external field, let the molecule be in the ground state Wo, 
without a dipole moment, Le. 


{Po = —e J oro dV =0 (3) 


To calculate the dielectric constant, start with the definition for the dielectric 
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Exercise 11,11 


constant € via the relation 

cE=E+4nP , (4) 
and insert the polarization P in a nonconducting solid given by 

P= Fe, — Nien (5) 


where N is the density of the molecules (i.e. numbers of molecules per cm*) 
and p, the part of the mean dipole moment —e{w*zdV, linear in Fo, of 
a single molecule oriented in the z direction. Calculate the dielectric constant e. 


Solution. a) For the electric field, we write 
E= Fysinwte, 


The related potential ¢(r, t) has to satisfy 


E=—orad@ (6) 
so that (vr, t) can be written 

olr,t) = O(2,1) = —zFosin@r . (7) 
Thus the electron feels the potential energy 

V(r, t) = —ed(r, t) = ezFosinat (8) 


and the Schrédinger equation for the molecular electrons can be written 
= Pe 
(Ho + ezFo sin wt)w(r, t) = ih e ee (9) 


b) With the given formulation (1) for w, the Schrédinger equation reads 


(Ho + ezFo sin wt) vot exp( — : Eat} + Fo (x, 0| 


1 a] 
= Eodatrvexp( — 7 For) + ihFo WO, t) ; (10) 
Comparing the terms linear in Fp on both sides, we obtain 


ezFy sin wt\o exp( — : Bot ) + Fy HoW(r, t) 
= ihF, ue t)=>| H ape yi) 
at , ¢ Ot 


= —ezsin noe — 5 Bot , (11) 
If we use the hint for Wy“), and note that 


ae : 
sin ot = i (eo Se ee 
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we get Exercise 11.11 


1 e 1 
How exp _ 7, (Fo + ho | + Hyw_ exp] — - (Eo — hon | 


— (Ey + hw)w, exp] - ; (Eo =e hoe | 


— (Ey — hw)w_ exp| = = (Eo ~ hore | 


eZ 1 ' & 
_ ~ <yoexp( — 7 Bot fe" —e ior) (12) 


The common factor exp[ — (i/h)E gt] can be dropped; the functions e*” and 


e ' are linearly independent, i.e. their coefficients have to vanish. This yields 


. ez 
How, — (Eo + hw)wi = 5 Vo ; 
Ss ez 
Hyw_ — (Eo — h@w)w_ = — 3; Yo : (13) 
i 


Now Eo + ha is not an eigenvalue of Hy: otherwise, w would be an absorption 
frequency; nor is Ey — hw, because Eo is the lowest eigenvalue of Hp (the ground 
state). Therefore the equations have no homogeneous solution and can be solved 
without ambiguity. Let E;(j = 0, 1, 2,...) be the eigenvalues of Ho (which we 
have assumed to be discrete) and ¢,(r), the related eigenfunctions. Then w, can 
be expanded: 


Os = DC Gh (14) 
y= 
Because of Hoy; = E;@;, we obtain 
. (+) = (2) CZ 
L CS Ee; — (Eo + hw) YC ej= +540 - (15) 
= 5 


Multiplication by yf and integration yield: 


1 
SG, (Bb, Eo — io) — sty, S222 (07 and we) 
Pa = <@lz|lWo> a 
we uy 2 ee ( 


and, analogously, 


w-(r) = — = 2 ee pln) (18) 


296 11. Perturbation Theory 


Exercise 11.11 Therefore we can write the wave function as 


+Sey 


Wot) =| Hol + » Pj (r)<@;|z|Wo> 


e iet ele i . ) (19) 
—_______ — —_________ | |exp/| -— = : 
eee =, fos a 


c) The mean dipole moment in the field direction 1s 


= 


eA 4 |< elz lo >I? 


e — jet est 
x 6S ee ee ee es 


2S ioe) ; 
+" ¥ Keilzlvo>| 


ele! e7 iat 
= OURAY 20 
and hence 
2 ice) 
Fo Die 2 2(E; — Eo) 
5 2 t ——_.—>—_—_—; 21 
z |z|Wo>|*2isin w (Hae ea (21) 


In the absence of the field (Fp = 0), only the ground state fo = @;=9 1s populated 
and whose dipole moment vanishes: 


(WolzZ|Wo> = 0 


Hence, it is the electric field of the light wave which partly polarizes the atoms or 
the molecules. This is quite reasonable and may be intuitively expected. 
With hw; = E; — Ep and the so-called dipole strength 


i= lz|Wo>|? @; 
(where m, = electron mass), we obtain for the dielectric constant 
: : 8nNe? 
&Foe,sinwt = Foe, sinwt + e, s 
x y | @lzl¥oo? 5 Fo sin wt 
= —w? 
4nNe? & ih 
=(1+ —— ]Fosinwte, . 22 
( m, »» be oO aa Ss 
Hence, 
4nNe? 2 
e(w) =1+4+ 2 Si ae (23) 


2 
iis pS Op 


This corresponds to the classical expression of the dielectric constant. 


11.7 Biographical Notes 


Since, to a good approximation, the refraction index n is related to the 
dielectric constant by n? = e, the above formula (23) represents the quantum- 
mechanical calculation of the refraction index as well. Contrary to the classical 
expression, the quantum-mechanical oscillator strength 


Z 


fy = Fa |<esl2l 90>? (Ey = Eo) (24) 


can also have negative values if the atom or molecule is in an excited state in the 
beginning. This leads to the phenomenon of negative refraction (see figure). 


11.7 Biographical Notes 


STARK, Johannes, German physicist, *15.4.1874 Schickenhof, in ThansiiB, district of 
Amberg, +21.6.1957 Traunstein. S. became a professor in Hannover; in 1909 he went to 
Aachen, in 1917, to Greifswald and in 1920, to Wiirzburg. He founded the “Jahrbuch der 
Radioaktivitat und Elektronik” in 1904 and discovered in 1905 the (optical) Doppler 
effect in so-called channel rays and in 1913 the Stark effect. He was awarded the Nobel 
Prize in 1919. In 1933 he became the president of the “Notgemeinschaft der Deutschen 
Wissenschaft”. He was a friend of P. Lenard and a supporter of “German physics”; thus 
he dismissed quantum theory and the theory of relativity as “the product of Jewish 
thinking”. 


HOFSTADTER, Robert, American physicist, *5.2.1915 New York. H. is a professor 
at Stanford University, California. He has considered problems of molecular structure 
and has contributed to the development of scintillation and crystal counters. H. proved 
that the proton and the neutron have a finite extension and structure. By examining 
electron scattering by atomic nuclei he succeeded in finding the charge distribution not 
only for the proton and neutron, but also systematically for many other nuclei of the 
Periodic System. For this research, H. was awarded the Nobel Prize in Physics in 1961, 
together with R. Mossbauer. 


BRIX, Peter, German physicist, *20.10.1918 Kappeln/Schlei. B. was a professor from 
1957 to 1973 in Darmstadt, then director at the Max-Planck-Institute fur Kernphysik in 
Heidelberg. Together with Kopfermann, he examined the isotope shift and built the first 
German electron linear accelerator in Darmstadt with which he and his colleagues 
measured accurately the charge distribution of atomic nuclei. 


By 


Exercise 11.11 


The refractive index n in 
the range of anomalous dis- 
persion in the case of (a) 
positive and (b) negative 
refraction 


12. Spin 


We have often mentioned the spin of the electron in our previous considerations. 
In this chapter we want to discuss the experimental evidence for the existence of 
spin. Furthermore, we shall develop its mathematical description. 

Like the Pauli principle, spin is a phenomenon which first occurred in 
quantum mechanics and has no analogy in classical physics. The electron was 
the first elementary particle whose spin was detected. Several experiments, 
which could not be classically interpreted, motivated Goudsmit and Uhlenbeck 
in 1925 to hypothesize: 


Every electron has an intrinsic angular momentum (spin) of 4h, which 
corresponds to a magnetic moment of one Bohr magneton, py = \e|h/2mc. 


In the following, we briefly discuss three special experiments. 


In the first chapter we mentioned the Stern-Gerlach experiment (1922) as an 
example for the quantization of the angular momentum’. Figure 12.1 shows the 
principle of this experiment. A beam of hydrogen atoms (in the original experi- 
ment, silver atoms) is sent through an inhomogeneous magnetic field. The atoms 
are in the ground state, which implies that the electrons are in the Is state; thus 
they have no orbital angular momentum. Therefore the atoms should not have 
any magnetic moment. However, a splitting of the beam into two components 
is observed, as the distribution of the intensity given in Chap. | shows. This 
splitting has its origin in a force? 


F=—V(—M-B)=V(M-B)=(M-V)B , (12.1) 


which acts on the magnetic moment M in the inhomogeneous magnetic field B. 
This splitting gives rise to the assumption that the electron has an intrinsic 
magnetic moment. Because the beam is split into components of equal intensity, 
it follows that all electrons have a magnetic moment with the same absolute 


! This experiment was performed at the Institute of Physics at the University of 
Frankfurt a.M. At that time, O. Stern was a “Privatdozent” working with Prof. E. 
Madelung at the Institute of Theoretical Physics, and W. Gerlach a university lecturer at 
the Institute of Experimental Physics. 

2 See J.D. Jackson: Classical Electrodynamics, 2nd ed. (Wiley, New York 1975) and 
W. Greiner: Theoret. Physik Band 3: Klassische Elektrodynamik, 4th ed. (Verlag H. 
Deutsch, Thun und Frankfurt a.M. 1986). 
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Fig. 12.1. The 
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Fig. 12.2. Doublet splitting 


value. They also have two possible orientations, i.e. parallel or antiparallel to the 
magnetic field. 

In principle it is also possible that the magnetic moment originates in the 
nucleus. But we shall later see that the ratio of the magnetic moments of 
the nuclei to the Bohr magneton is approximately equal to that of the inverse 
of the corresponding masses (1 ¢tectron/ proton): Indeed, a more careful analysis of 
the Stern-Gerlach experiment reveals a fine structure of the lines caused by 
the magnetic moments of the nuclei. 


12.1 Doublet Splitting 


A further proof of the existence of electron spin is given by the multiplet 
structure of atomic spectra. Let us take, for instance, the doublet splitting of 
sodium. Sodium has one valence electron. The transition of this electron from 
the first excited state to the ground state (2p —> 1s; see Fig. 12.2) leads to two 
adjacent spectral lines of 5890 A and 5896 A. 

Although the 2p level is three-fold degenerate (m = 0, + 1), this degeneracy 
can be removed by an external magnetic field. But doublet splitting can already 
be observed without an external magnetic field. This can be explained by 
assuming the existence of an electron spin in the following way. 

Owing to the magnetic field originating from the orbital motion of the 
electrons, the intrinsic magnetic moment — stemming from the spin — orients 
itself in two energetically different positions. This is analogous to the splitting of 
the beam in the Stern-Gerlach experiment. Doublet splitting follows from the 
two orientation possibilities. The splitting of the spectral lines caused by the spin 
is observed in all atoms and is called multiplet structure. 

The magnitude of the magnetic moment of the electron caused by the orbital 
motion can be determined experimentally. It is a multiple of the Bohr magneton: 


V4 | ie el ny) 2 oe (12.2) 


Indeed, classically, the magnetic moment caused by the orbital motion is given 
by the formula? 
1 I q q 
M=—|r'xj(r')dV’ = —rxl=— =——L 
ae He) 26) = Oe a oF 
where gq is the charge, v the velocity, m the mass, j(r’) = qu(r’)d(r’ — r) is the 
current density and L the orbital angular momentum of the particle. 
As we have seen in Chap. 4, the z component of the orbital angular 
momentum is quantized according to 


i cig eee ie) ee (12.4) 
* See J.D. Jackson: Classical Electrodynamics, 2nd ed. (Wiley, New York 1975) and 


W. Greiner: Theoret. Physik Band 3: Klassische Elektrodynamik, 4th ed. (Verlag H. 
Deutsch, Thun und Frankfurt a.M. 1986). 


12.2 The Einstein-de Haas Experiment 


Hence, we expect from (12.3) that M, = gm). For each angular momentum /h 
there are 2! + 1 possibilities for adjusting the magnetic moment [i.e. (2/ + 1) 
values for m,— see (12.4) ]. In the case of spin momentum, only two such different 
orientations occur. Therefore we conclude by analogy that the component of the 
spin parallel to the field is only half of Planck’s constant: 


Sa= th and $,= —th (125) 


Sincewzom (2/, + 1) — 2 it follows that 7, = 1/2, it is clear that the spin of the 
electron, i.e. its intrinsic (rotational) angular momentum, is +h. This would 
indeed explain the two orientations observed both in the Stern-Gerlach experi- 
ment and also in doublet splitting. 

Thus the spin of the electron is said to be half-integer. This property of the 
spin marks a further difference between spin and orbital angular momentum. An 
orbital angular momentum of /h has a maximal magnetic moment of Ij1g [see 
(12.3)]. According to the measurements, the spin of an electron with a magni- 
tude 5h also implies a magnetic moment of zy. This is most surprising because 
we would have expected the spin magnetic moment to be only 4 fg. 

To solve this dilemma, we introduce a new factor g. The connection between 
angular momentum and magnetic moment is generally written as 


M = g(q/2mo) J, (12.6) 


where J denotes the orbital angular momentum or the spin and q is the charge of 
the particle. The quantity g is called the gyromagnetic factor or g factor. For the 
orbital angular momentum we have g = 1 [see (12.3) ]. For the spin, however, 
g = 2. Since the electron is negatively charged, gq = —e, its magnetic moment 
is always antiparallel to the angular momentum. If we combine the angular 
momentum L and the spin S into a total angular momentum J, the resulting 
total magnetic moment M is not parallel to the angular momentum, because of 
the differing gyromagnetic factors (see Fig. 12.3). The total magnetic moment 
M precesses around the total angular momentum J. Therefore, after averaging 
over time, only the component in the J direction remains. (In Example 12.3 we 
shall discuss this fact in more detail.) 

In the case of atomic nuclei, the “nuclear magneton” is commonly used as 
a unit for the magnetic moment. It differs from the Bohr magneton in that it 
replaces the electron mass min the denominator of (12.2) by the proton mass m,. 
Therefore the nuclear magnetic moments are roughly three orders of magnitude 
smaller than those of the electron. The interaction between the nuclear moment 
and the moment of the electron leads to the hyperfine structure of the spectra. 


12.2 The Einstein-de Haas Experiment 


If an iron bar is magnetized, not only the elementary magnetic moments, but 
also the elementary angular moment which cause them, change and become 
oriented. Because of the conservation of the angular momentum, the iron bar 
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Fig. 12.3. Vector addition of 
the orbital (L) and spin (S) 
angular momentum and 
the corresponding magnetic 
moments M, and M,, re- 
spectively. The resulting 
total magnetic moment M is 
— because of the spin-g-fac- 
tor g=2 — not collinear 
with the total angular mo- 
mentum J. This leads to 
a precession of M around J 
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Fig. 12.4. Setup of the Ein- 
stein-de Haas experiment 
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as a whole must change its macroscopic angular momentum, too. From the 
magnetization and the angular momentum of the bar, the gyromagnetic ratio 
can be determined. 

This basic idea led Einstein and de Haas in 1915 to an experiment that makes 
it possible to measure the gyromagnetic ratio (see Fig. 12.4). An iron bar is hung 
from a string in such a way that it can rotate about its axis. From the torsion 
vibrations of the string, the angular momentum L, which the bar gets as a whole 
when it is magnetized, can be measured. Suppose that N electrons with their 
elementary angular momentum j contribute to the magnetization. Then 


Nj+L=0-L=—-Nj d27) 
holds and correspondingly we get for the magnetization 


Moar = NM Seetron = Ng(e/2mc) j ad —g(e/2mc) L = oe 9 (fp/h)L 
(12.8) 


By measuring the macroscopic quantities L and M,,,, we can determine the 
gyromagnetic ratio g of the elementary magnetic moments and the angular 
momenta which cause them. LZ is measured by the deflection of the light ray 
(see Fig. 12.4); M,,, can be determined by the residual magnetism (after previous 
gauging). 

As a result of the Einstein-de Haas experiment, a magnetic moment of 
+ 211, was found; thus g = 2. The negative sign results from the negative charge 
of the electrons. The magnitude of two Bohr magnetons excludes the orbital 
angular momentum as the source of the ferromagnetism and can only be 
explained by the existence of the electron spin. 

A further conclusion to be drawn from this experiment is that the magne- 
tization of the iron bar does not originate from elementary magnetic monopoles, 
but stems from electric currents caused by angular momenta. 

We also note in passing that a precise determination of the g factor of the 
electron follows from measuring the doublet splitting and also from the Rabi 
experiment, which we shall discuss later in Example 12.2. 


12.3 The Mathematical Description of Spin 


Spin is an angular momentum; therefore its mathematical description is analo- 
gous to the formalism of the orbital angular momentum, which we became 
acquainted with earlier*. In this chapter we will deal with some peculiar features 
resulting from the facts that spin is a half-integer and that it can orient itself in 
one of two ways. 


* We treat the algebra of angular momentum in great detail when discussing symmetries 
in W. Greiner, B. Muller: Quantum Mechanics — Symmetries, 2nd rev. ed. (Springer, Berlin, 
Heidelberg, New York 1994). 


12.3 The Mathematical Description of Spin 


Experiments suggest the existence of a spin vector S= Se SF Cy which 
has three components: S,,S,, and S_,; it should be an angular-momentum 
vector operator. The characteristic feature of angular-momentum operators is 
their commutation relations. Therefore we require that the Se Se S, obey the 
same commutation relations as the operators L,, Ly, L, of the orbital angular 
momentum. This is the manifestation of the spin as an angular momentum. 
Hence, 


SSeS. —1hS,, (12.9) 
or, using the abbreviations {Sin Soe Gat = ee S, ca 
CS = SS 1h ee Sh 0 (12.10) 


Here, &;, is the complete antisymmetric tensor 


1 for even permutations of 1, 2,3 
Eijk = 0 for 2 or more equal indices (i201) 


—1 for odd permutations of 1, 2,3 


With this ¢,, tensor, e.g. the cross product of two vectors A = {A;} and B = {B;} 
can be written as 
ij 

Furthermore, the operators G, should be Hermitian, i.e. cS = os to guarantee 
that their expectation values are real. 

For the representation of the operators, it is customary to use the Pauli 
matrices 6;. To exclude the factor 4h from the equations, we define them in the 
following way: 


n= oe) S, =e es one, (2:13) 


Thus, the commutation relations (12.9) take the following form: 


GGa—0.0,— 210, | (12.14) 
or, in a more compact form, 


A 


6,6; — G6; = 218% Ox . (12.15) 


Because the spin components S; have, according to their two possible orienta- 
tions, only the two eigenvalues, + +h, the spin matrices must be 2 x 2 matrices 
which have — as we know — exactly two eigenvalues. In the following, we take the 
z direction as the “direction of quantization”. Then the z axis is the axis which 
the orientation of the spin is related to. Mathematically, this means that the spin 
functions are given as the eigenfunctions of the matrix 6,. 
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The matrix ¢, is diagonal in its eigenrepresentation and has the eigenvalues 
+ | as diagonal elements: 


thie 6) 5 mre 
— —_— a 12.16 
6, (4 *) and) a; in ') ( ) 


For the matrices 6, and ¢,, the analogous relations hold in their eigenrepresen- 
tations. Since the unit matrix remains unchanged when we change the repres- 
entation, the identity 


Go = 0,07 (12.17) 


holds generally. To obtain the matrices ¢, and @, in the eigenrepresentations of 
é,, we start from the commutation relations (12.14). Multiplication of the second 
equation in (12.14) from the left and from the right by 6,, and addition of both 
equations yields 


i, 6, — od, = (12.18) 
if we also take (12.17) into account. This means that, independently of the 
representation, 

6,6, + 6,¢,=90 (Le eS) 


holds, and likewise for the other components. 
The Pauli matrices are anticommuting. These relations can also be written in 
the following form: 


ig, Gy lee (6,65) "(Gaon lpr Ome (12.20) 
Relations (12.17) and (12.20) can be combined in a more compact form: 


To calculate the matrices 6, and 4, explicitly, we write the following: 
a G1, Ay2 a bi, by2 
6.= y= : Iz 
e i i ie by» ( ) 
From the anticommutation relations (12.20) of 6, and 6, with é,, we get 
a a —a a 
( i v \=( ” ae (12.23) 
—Ax, —Ar2 —A21, A22 
and thus a,;,; = a2. = 0. Therefore 


0 
a =( ae (12.24) 
ax, O 


Since the matrices should be Hermitian, é6, = 6; = G** is valid and hence 


az,=ai, , (12.25) 


12.3 The Mathematical Description of Spin 


so that 


A O ay la le 0 
6. = and = = 12 
x c. 0 ) on ( 0 sae ; (12.26) 


Because of (12.17) 62 has to be fi: it follows that 
lari? =1 . (i237) 


The appropriate way of writing the matrix element is e'%, « being real. 
For the matrix é, we can proceed in an analogous way; i.e. we write the 
matrices in the form 


ea ce 5 0 ei 
Gy =e e iz + Gy = eae 0 . (12.28) 


Applying the anticommutation relation (12.20) to 6, and 6, leads to 
eilz—B) 0 e ila) 0 
0 ene 7 -( 0 —) ? (12.29) 


Ce =e G8) ete ay (12.30) 


Or 


Thus « — $8 = 4x. All relations can be satisfied if only the last one is fulfilled. 
Therefore we set 


fet fee. (12.31) 


and get the Pauli matrices in the 6, representation: 


_ 6 1 One) ; 1 0 ee 
Oy = , Oy = . 2 oa | ig 
1 0 rom Aa kG 2S ail ae?) 


The unit matrix together with the Pauli matrices are four linear-independent 
matrices, which can be taken as a basis in the space of the two-dimensional 
matrices (compare with Exercise 13.1). They are also suitable for the description 
of other physical quantities which appear only in two states. Precisely for this 
reason, we again find the Pauli matrices in the formulation of the isotopic spin, 
which describes the states “proton” and “neutron” of a nucleon. 

The total spin is 

Sass Seas: 22. 2 ge a =3(5+ 1\0 

ss : ard ee 2 4 2\2 ; 
(12.33) 


in complete analogy to the formalism of the orbital angular momentum. From 
the commutation relations (12.9), it also follows that [S?, S;]_ = 0, i.e. each spin 
component commutes with the square of the total spin.” Of course, this also 


5 To this end the conclusions reached in Chap. 4 can be repeated step by step. 
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follows from (12.32) and (12.33), because the unit matrix commutes with every 
other matrix. Since S? is proportional to the unit matrix, it is immediately 
obvious that 

is". Si] -=0 
holds for all i, 


12.4 Wave Functions with Spin 


By taking the spin into account, we assign a further degree of freedom to 
a particle. To describe this degree of freedom, we additionally introduce the 
component of the spin in the z direction, S,, as an argument of the wave 
function. The component S, can take only two values, namely + 4h. Therefore 
the wave function has the following coordinate representation: 


w=wlr,S.,t) . (12.34) 


Since S, takes on only two values, it is useful to denote the wave functions with 
spin as column vectors with two components (spinors). This concurs with the 
fact that the spin operators S; are represented by 2 x 2 matrices. The two compo- 
nents of the spinor are 


Wilr, j= Wr, + h, t) 9 Wor, th = wr, —$h, t) ’ (12.35) 


while the complete wave function is 
= Wi (r, t) a2 
v= ve 4) za Wily, x4 a Wr (r, yee 
= wsto(4)+veta(9) (1236) 


The introduction of product functions for both components, i.e. 


1 
Wir, Ox+ = Wr, (5) and Y2(r, )x- = wa(r, t) (") (12.37) 


is particularly convenient. The functions x, indicate only the state of the spin, 
Le. “spin up” or “spin down”. |W, |? is obviously the probability of finding an 
electron with spin up at the location r and the time t. Correspondingly, ||? is 
the probability of finding an electron with spin down at the location r and time t. 
This interpretation suggests that the total probability of finding the electron 
independently of its spin direction must be 1; thus 


1 


Sil. oP + Welr, OP)aV = J WEY) (\ 


Jav= ju ve.nay = 
(12.38) 


Spinor notation offers a clear formulation of how the spin operators, written as 


12.4 Wave Functions with Spin 


Pauli matrices, act on spinors. The eigenstates of the operator 6, are 


a hl 2 CG = (8) and (12.39) 
(6 SQ)-008) a 
“\ Yh =I te) , 


The spin functions x, are unit spinors: 


aay al d 0 
1+ = A an =| (12.41) 


Obviously, 


(i )= vars » OF (, )= ve . (12.42) 


The unit spinors are, as can easily be seen, eigenfunctions of the spin operator 
6, with the eigenvalues +1 and —1, respectively: 


S&S) 


S 


ye Nye and oye —(— ly... (12.43) 
Let us write down an arbitrary spin operator in the form 
a Sy 1 S12 ) 
S= (12.44) 
ie S22 


If we use the matrix representation, an operator acts on a spin function by 
matrix multiplication (see Chap. 10): 


@=SY , where o=(%) and Fels) | (12.45) 
P2 Wo 

In a more detailed form, this relation reads 

(S)= (5 eles ec.) (12.46) 

P2 S21 S22) \¥2 Sait + S2aha 

or, for each component: 

@1=Si1Wit+ Sirah , 

2 = Sai + S222 - (12.47) 
The average value of an operator is defined by [compare with (10.96) ] 

(Sy=[wrswvav . (12.48) 


If the wave functions are spinors, we have to use the Hermitian-conjugated wave 
functions, instead of the complex-conjugated [see (10:27) ie 


vi \* 
Bi ae (12.49) 
ee ee 


The average value is then easily calculated as 


(S(t) =] Sir, Dav = J w+ Sir, t)wdV , (12.50) 
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UR PE EEE eee 
with 
(Sir,)) = ¥tS¥ 


a Sy4 S12 I 
Siaay=orvn (St Se \(N) 


(S(r, > = WESiid + WE Sate + WESai + WES - (12.51) 


Here, éS(r, t)> is the average value of the spin operator (averaged over the spin 
directions) at the location r and time ¢. On the other hand, CSi)>a is the average 
over the spin directions and each location at time t. 

Let us apply the above to calculate the average value over both possible spin 
states of the Pauli matrices. For the x component we have 


v1 
Yo 


In an analogous way we get for the y component 


Korte.) = 86 = EVD (| wl )- Wiw2twrW1 . (12.52) 


(a> = wtwn(; 0 Cy )= nieve + viv, aa (12.53) 
1 0 Wo 

<a.) = wt) ( Ch Viv, — Wide (12.54) 
Zz af 2 0 =| W Ai: Sa 2, 2 


12.5 The Pauli Equation 


In Chap. 9 we developed the Hamiltonian for the motion of an electron (charge 
e) in an electromagnetic field in the absence of spin. This Hamiltonian reads 


A Ree 
Ho =—|p--A]|+ed , (1235) 
where A is the vector and ¢ the Coulomb potential. Since the spin interacts with 


the magnetic field, the electron gains additional potential energy. The magnetic 
moment reads 


5. fal won = Elive 
M= = 2 |——= =— > : 
( 2mc ° 2me 2 eX oo) 
[tg = (\e|f/2mc) and the potential energy in the magnetic field® is: 
U=—-M-B . 237 


® See J.D. Jackson: Classical Electrodynamics, 2nd ed. (Wiley, New York 1975) and 
W. Greiner: Klassiche Elektrodynamik, 4th ed. (H. Deutsch, Thun 1986). 


12.5 The Pauli Equation 


The Hamiltonian of an electron with spin takes the following form: 
H=Ay+jpe-B . (12.58) 


We can use the information about the g factor of the electron (g = 2), which 
was discussed earlier, precisely at this point. With this Hamiltonian [ (12.55) and 
(12.58)], we get the Schrédinger equation of a particle with spin, known as the 
Pauli equation, 


ee es 2 F Secu 
where 
a 
YO oe 12.60 
( ee 


are the spinor wave functions. We call such two-component wave functions 
simply spinors; sometimes they are called two-spinors to distinguish them from 
the four-spinors occurring in relativistic quantum theory’. 

Thus the Pauli equation is a system of two coupled differential equations for 
Ww, and w., describing electrons with the z component of their spin up or down, 
respectively. Because of the form of the Pauli spin matrices, we can easily see 
that the system (12.59) is decoupled for ¢, and only coupled by 6, and 4,. 

In the following we shall calculate the current density which results from the 
spinor equation (12.59). To do this, we write it in the form 


a 


yw iB 
ih— = Ho¥ + ty6-BY (12.61) 


The adjoint equation of (12.61) reads 


= HtY* + pp (6-BY)* = HeY* + pl *e-B , (12.62) 


because 6 is Hermitian and the magnetic field B is real. Now we multiply (12.61) 
from the left by ¥* and (12.62) from the right by ¥. Subtraction of the equations 
yields 


iho ty = Wt(HY)— (He et)Y . (12.63) 
If we insert Ho, all parts which contain no operators drop out, i.e. we are left 
with 
ree ee a wry (wey ¥] FSi ay + A-V)¥ 
ot 2m 2mc 
(VAs Ve ey. (12.64) 


7 See Chap. 13, as well as Theoretical Physics Vol. 3, by W. Greiner: Relativistic Quantum 
Mechanics — Wave Equations (Springer, Berlin, Heidelberg, New York 1990). 
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OE EE OEE 
The first term on the right-hand side can be transformed into 
pryey — py yt = div(P VE —- PVT) . (12.65) 


For the second term (we must pay attention to the order. ¥* is the first and 
Y the second factor!), 


PTV ALAWY +[(VA FAVE IY 
—2Y' Wdiv A + 24-(V VY + (VYT)YP) 


= 2Y + div A + 24-V(UTY) = 2div(A¥TY) , (12.66) 
is valid. Therefore from (12.63) it follows that 
alee ee A FP ihe .. " 
pe P= = divi Ph Ve (Ve —— divas 
ot 2m mc 


This is the continuity equation in the form 


a 
= +divj=0 , where (12.68) 
wawty (12.69) 


is the probability density and 
. ih + + e ota 
p= == Va ee (12270) 
2m mc 


is the current density of the electrons. 
Now we insert the two-component wave function 


P= be) and Wt = (wt, ps) ea) 
Wr 
and arrive at 
w= (Wit, + Wr) (12.72) 
and : 
. ih * 3 £3 * re * 
f= WAV UE + VUE — WEVA — UEVU2) — AW + Via) 
(12.73) 


or, rearranging, 
Nie Wve eae 
i Jin ! ! : Mic V1 


ih 
+ (Wa VUE — WIV.) ——— AUT (12.74) 


It turns out that both the probability density and the current density are 
composed additively of the parts of the two different spin directions; this is 
reasonable. Multiplication of the particle current density j by the charge e yields 
the electrical current density j.. 


12.5 The Pauli Equation 


The current density j, does not contain the spin; rather it is the current 
density caused by the orbital motion of the electrons (with a different spin). 
However, the spin of an electron also causes a magnetic moment, which can be 
expressed by a corresponding current. We shall call this part of the current 
density j, the spin current density. This current density cannot occur in a conti- 
nuity equation in which the charge conservation is expressed by convection 
currents. 

To calculate the spin current density j,, we start with Maxwell’s equations. 
For the curl of the B field the following well-known relation® exists: 


curl B= (4n/c)\(j. +ccurl<M)) . (Le) 


We have replaced the magnetization <M) in this case by the averaged density of 
the magnetic moment <M), where averaging over the spin states is meant. The 
magnetic dipole density is given by 


GUE = SU (12.76) 
and thus 
curl B = 4n/cj = (4n/c)(j. — cuy curl ¥* 6) = (4n/c)(j. + js). (12.77) 


The contribution 
j= —cHp curl YoY (12.78) 


is the current causing the magnetic moments of the electrons, and is equivalent 
to them. 


RS: eS 


12.1 Spin Precession in a Homogeneous Magnetic Field 


Problem. Determine the precession of the spin in a homogeneous magnetic field 
(see figume): 


Solution. If a charged body moves in a homogeneous magnetic field, it circulates 
around the direction of the field with the frequency w = 2w, = —eB/mc. Here 
the charge of the electron is (—e). This follows from the fact that the Lorentz 
force balances the centrifugal force: 


hve = ier (1) 


8 See J.D. Jackson: Classical Elektrodynamics, 2nd ed. (Wiley, New York 1975) and 
W. Greiner, Theoret. Physik Band 3: Klassische Elektrodynamik, 4th ed. (Verlag H. 
Deutsch, Thun und Frankfurt a.M. 1986). 


M 


Particle with spin and mag- 
netic moment M precessing 
in a magnetic field B 
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Exercise 12.1 


Thus 
— —eB ing. (2) 
whereby wm, = —eB/2mc is the so-called Larmor frequency. 


The spin function at time t = 0 reads 
L=4ox+ + box- . (3) 


If we take for the constants ay = e'” cos (@/2) and by = e’’ sin(@/2), the normal- 
ization condition 


|ao|? + |bo|? = 1 (4) 


is obviously fulfilled. Now we shall calculate the frequency of precession of the 
spin in the magnetic field B = {0,0, B,}. 

Let us assume that the electron is fixed at a certain location and its spin is the 
only degree of freedom. That part of the Pauli equation (12.59) which contains 
the spin yields 


(5) 


The spin function written as a column vector reads 


= ars + rx-=a(4)+6(1)=(2) (6) 


Inserting this into (5) results in 


=m )G)-a(5) . 


and thus 

G=—io,4 . b=ig,b. | (8) 
After integration we obtain 

a> aje °™ 5 b = bo eo ; (9) 


The time-dependent spin function thus reads 


et el? cos (0/2) 
So iat aid o (10) 
ee sin(@/2) 
The expectation value of the spin is obtained from 
ees h 
S —— $ — el + ¢ 
(S) = 58> = 51° br 
h + A +A + 4A 
= 5" FX Ky XG eK) (11) 
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Inserting the Pauli spin matrices from (12.32) and x of (10) into (11) leads to 
~ h 
cS 5 [cos(2m,t+6—y)sinO , 


sin(2da,t+é6—y)sinO , cosO] . (12) 


Obviously, the spin component in the field direction S, is conserved while the 
spin precesses around the z axis with twice the Larmor frequency 2@,. This is 
due to the gyromagnetic factor 2 of the spin. In contrast to this, the average 
value of the orbital angular momentum precesses with only the frequency 
@, around the z axis [see Example 12.3, (15) ff.]. 


EXAMPLE 


12.2 The Rabi Experiment (Spin Resonance) 


To measure the nuclear magnetic moment, Rabi developed the method of spin 
resonance. The scheme of this experiment is sketched in the figure below. 


yy 3 
oe? Saal aera 
| B, dB Detector 


eae jen 2 al 


If the particles reach the inhomogeneous magnetic field 1, they will be 
deflected, depending on their spin orientation, in such a way that the slit A can 
be passed only by particles with a certain spin direction. The homogeneous field 
2 has no influence on the deflection of the particles. Thereafter, the particles 
enter field 3, which has a field gradient opposite to that of field 1. Field 3 cancels 
the original deflection, so that the particles reach the detector. 

If the homogeneous field 2 is superposed by an oscillating field, which leads 
toa spin flip, then the particles will be deflected in field 3 in the wrong direction 
and will not reach the detector. From the frequency of the oscillating field 
(the resonance frequency), leading to a minimum in the beam intensity at 
the detector, the magnetic moment of the particle can be calculated. 

Now we want to investigate mathematically the behaviour of a particle with 
spin + +h in an inhomogeneous magnetic field B, on which a weak oscillating 
field is superposed. The magnetic field should have the following form: 


B=(Bocosm@ot , Bosinwot, B,) . (1) 
We write the following for the spin function: 


y(t) =a(de 7 x, +b y- , (2) 


Exercise 12.1 


Setup of the Rabi experi- 
ment. The magnetic fields of 
magnets | and 3 have strong 
gradients in opposite direc- 
tions, while the field of mag- 
net 2 is homogeneous 
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where @ = —B./h is the Larmor frequency for the precession of the spins 
around the homogeneous field. The factor ss denotes the connection between the 
magnetic moment and the spin of the particle: 


Ve — Clee (3) 
We start from the spin-dependent part of the Pauli equation: 


8 
ih = — B-My = —nB-6y (4) 


f 


Inserting (1) and (2) into (4) yields 


A t iat 
h : eae je — p(Bo COS Wot Gy 


at b(t)ei® 
a(t)e —i@t 
Bie 


Now explicitly inserting the Pauli matrices and calculating the derivatives yields 


dee ae et bei 
in( fiat + her ( eet = — Bo COS Wot Cos 


+ Bo sinwotd, + B,G,) ( (5) 


=a; eiat ae —it 
—ipBo sinoot (ar ) — #Be i ce) : (6) 
The last terms on both sides cancel each other because ha = — yB,. Together 
with h@’ = — "Bo, we can write for both components of the spinor x 
10 Ve (7) 
= Se EP (8) 


These equations can be decoupled by taking the derivative of the first one and 
eliminating b and b: 


&@—i2—w)a+62a=0 . (9) 


To solve this homogeneous differential equation, we try writing a oc e'®' and 
thus get the characteristic equation for w having the solutions 


01.2 =@- St MG — 09/2? + 6? (10) 
Or 


O15 = 40 5 Q=6-> 9 


6 = (® — @o/2)P? +”? . (11) 
The common solution for the coefficient a is therefore given by 


We choose the initial conditions in such a way that the particle at t = Ois in the 


12.5 The Pauli Equation B15 
a 


spin state y4, i.e. |a(t = 0)|? = 1 and b(t = 0) = 0. Thus from (12) it follows that Example 12.2 
ay + az = 1 q (13) 
Together with (12) and (7) we get for the coefficient 5 
9 ia 
erm aa O)e 1 1a,(2 — dja? — 3) (14) 


Starting from the initial conditions, we can now calculate the coefficients a, and 
az. From b(t = 0) = 0 it follows that 


a,(Q+6)+a,(Q—6)=0 . (15) 
Together with (13) we get 


1 Q 1 Q 
Cee i , O25 Nats : (16) 


Both amplitudes a(t) and b(t) are given by 


a(t) = (co a i sin ar Jel : 
. (17) 


b(t) = -~— sin 6te ~i2 


From the terms in (2), returning to the spin, we realize that the quantity |b(t)|? is 
the probability of finding the particle in the state y_ at time tf: 


|b(t)|? = (@'2/62) sin? ot. (18) 


Lét to be the time the particle needs to pass through the oscillating field. 
The experimental data are to be adjusted in such a way that after this time, 
the largest possible number of particles are in the state y_. At this time, 
the maximum of the spin-flip probability |b(¢)|? will be reached. From 
d|b|? /dt = 0 it follows that 


sin dtcos ét=0 . (19) 


The sin’ df curve of (18) has its maxima at the same locations at which the sine 
function has extrema. Hence, the maximum, which we are looking for, is given 
by the zero of the cosine function. 

Therefore, at time fo, it holds that 


us 


(oe ~ Puy et2 
2,/(@ — Wo0/2)° + @ 


The time fg is fixed by the velocity of the particle and the size of the area which 
the oscillating field occupies. Equation (20) contains, in addition, the data on 
the magnetic field and the unknown magnetic moment p, which can thus be 
determined. 


Oto = or (20) 


Ia 
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EXAMPLE 


12.3 The Simple Zeeman Effect (Weak Magnetic Fields) 


As a further example of the application of the Pauli equation, we consider the 
splitting of the spectral lines in a weak magnetic field. Here we shall treat the 
simple Zeeman effect, i.e. we neglect the spin-orbit interaction. 

The spin-orbit interaction leads to the fine structure of the spectra, a further 
splitting, which we shall not take into account here.” 

The magnetic field should be homogeneous and possess only a z component: 


5 0008 ae (1) 
In this case, we may express it by a vector potential 

A = { —7By,5Bx,0} , (2) 
as can easily be shown by the relation 

B= curl (3) 


We denote the Coulomb potential by ¢. 
Again we start from the Pauli equation for a particle with the charge e. The 
Hamiltonian reads 


_ 1 e 2 eh 
H=—| p—-A —— 6° : 
x @ ) pag En (4) 


Since the magnetic field is weak, we neglect the term with A? and get, using 
div A = 0, 


ad) 
P e . eh , Poe 
a_i ep. eG ee 
& mc Uae ay aa Lert ; (9) 


Instead of the term Ap, we introduce the angular-momentum operator. 
According to (2) we get 


B Bf é a Box 
A ae Be eel A = A j= a ee ws = 
Pay PA) = € fe =) 5 bs 
Together with Hy = p?/2m + e@, (5) leads to 


ts ~ eB. 
ay aie A 
ih a Ay ¥ ame (ho. ) ae (6) 


Since we are interested only in the energies of the stationary states, we write the 


° We shall discuss this topic in Vol. 3 of this series, W. Grenier: Relativistic Quantum 
Mechanics (Springer, Berlin, Heidelberg, New York 1989), where it follows naturally from 
the Dirac equation. 
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following for the wave function: 


i 
Hint =voexp( —-er) ) (7 
Thus (6) can be transformed into the eigenvalue equation 
pW: eee = te 
Taking the Larmor frequency w, = —eB/2mc, and applying the spinor nota- 


tion, we get 


~ (Wy ~ ES Wr Wi 
Maly Jro[een(s “)]()-#(4) 


A 


Both spinor components are decoupled (since 6, is diagonal) and yield the 
equations 


How; za One, +h), =EW , 
How, + o(L,— Wy. = Ev. . (9) 


If the magnetic field were absent, we would get the eigenstates of Hy as solu- 
tions — identical solutions, in fact — for both spinor components, as a look at (9) 
tells Ws: 


Wi = V2 = Wrim = Ral?) Vip lOno) (10) 
Since the wave function w,),, is an eigenfunction of jee 
be di ee (11) 


Wim is also an eigenfunction of the complete equations (9). Thus the wave 
functions are not altered in the approximation (A? = 0) we have used. 
Together with the eigenvalue equation of the operator Ho 


Ho Wntm a pm nim 


and the relations (10) and (11), we get from (9) the energy eigenvalues: 


Ean = pe + wy h(m + 1) for W= Cc 


0 
and 
" oO 0 
Em = EX —o,h(m+1) for Y= i : (12) 
nim 


Because of the magnetic field, the energy depends on the orientation of the 
magnetic moment with respect to the field direction. Levels which are degen- 
erate when the magnetic field is missing then split up. The two-fold splitting of 
the s states, which have no orbital magnetic moment, is proof of the existence of 
spin (the Stern-Gerlach experiment). 

The following figure shows the splitting of a Wyo and a Wo1,, State. 
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Splitting of the 1s and 2p 
levels in a magnetic field (the 
Zeeman effect) 


B ey) Nye) = scuiy 


B Aw = 0 


The classical understanding 
of the Zeeman effect 


Spin 

up 

up 
up/down 
down 
down 


up 


down 


The 2p state thus splits into five levels; one of them is twofold degenerate. 

Since the interaction of the spin with the light wave emitted when 
a transition occurs is small, the spin is not altered. Therefore only transitions 
between states of equal spin direction occur; these transitions are indicated in the 
figure. (Commonly, the dipole selection rule 4m = + 1,0 holds.) 

We get the transition frequencies by using the differences of the energies (12). 
Since the spin direction does not change, we have 


hw = Byam = Eee 
0 0 , " 
= Evy = Ewe + whim tlds ) 
— Wo ~ oy (m I m’) > 


where Wo is the transition frequency if the magnetic field is missing. Since the 
difference is m’ — m” = + 1,0, we get two spectral lines in the magnetic field, 
shifted from the original one by + @,. 

This result coincides precisely with the classical theory of the Zeeman effect 
(see next figure). Here, circular motion of an electron in a magnetic field is 
investigated. The centrifugal force mrw* and the Lorentz force + erwB/c act on 
the electron, depending on the direction of motion. 

Thus 


mro? + erwB/c = mr(w + Aw)? 
holds. If we neglect the second-order term oc (Aw)? we get 
Aw = eB/2mc = way 


In the figure, the decomposition of the circular motion is illustrated. The motion 
leading to a shift in the frequency proceeds in the plane perpendicular to the 
field. 4@ = 0 corresponds to a motion parallel to the field. 

We noted earlier that an angular momentum of a classical charged particle 
in a magnetic field precesses with the Larmor frequency around the direction of 
the magnetic field (see Exercise 12.1). It is possible also to identify a precession 
by treating the Zeeman effect quantum mechanically. 

The Hamiltonian in (6) can be written in the form 


H = eh fe ce a 2a Se ‘ (13) 


The rate of change of the angular momenta results from the commutation 
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relations 
ile 
dt 


ih hes 
=-—[H 
al ME mS 


and analogously for the other components as well as for the spin. The compo- 
nents of the orbital angular momentum commute with Hy and S,; therefore, 
only the commutation relation containing L, remains. Together with the com- 
mutation relations of (4.65), it follows that 


dL, B dL, : dL. 
Pa ee Oe Gs = 0 (14) 


The second derivatives with respect to time follow immediately from these 
equations: 


a aie oak 
=-wil, , S#=-o}f, | (15) 


We know that the expectation values fulfil the same relations as the operators. 
As can easily be calculated, (14) and (15) have the same solutions: 


(Ly) =Asin(ayt+¢) , 
ya = 1 COSI, + @) 
Ci const . (16) 


The same commutation relations also hold for the spin. The spin operator 
commutes with Hy and L., but not with the term containing So in (13). We get 
relations equivalent to (14) and (15), whereby, corresponding to (13), «, is 
replaced by 2a, (cf. Exercise 12.1): 


{S,.> =Asin2Qat+¢) , 
<S,> = —Acos(2@,t+) , 
<Soy=—const . (17) 


As indicated in the figure above, these equations imply that the components of 
orbital and spin angular momentum parallel to the magnetic field (L, and S,, 
respectively) are constants of motion. On the other hand, the components 
orthogonal to the magnetic field, LZ; = (L,, L,) and $, = (S,, S,), rotate with the 
Larmor frequency, w, and 2m ,, respectively. 

Since we have neglected the coupling between spin and orbital angular 
momentum here, both vectors precess independently around the magnetic field. 
The z component of the orbital angular momentum L,, and that of the spin S,, 
remain, as mentioned above, constant. We should note that the spin rotates 
twice as fast as the orbital angular momentum. Taking into account the 
corresponding gyromagnetic factors (compare with the section “Doublet Split- 
ting”), the magnetic moment M, given by 


M = M,,+ Msg = (pp/h)(L + 28), (18) 
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Precession of the orbital 
angular momentum and 
the spin (a), as well as their 
corresponding magnetic mo- 
menta (b), around the mag- 
netic field (z axis) 
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behaves analogously. Owing to the absence of LS-coupling, we get for the 
z component of M directly 


M, = (#p/h)(L, + 282) (19) 
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+4.12.1978. G. taught from 1928 to 1941 at the University of Michigan in Ann Arbor, and 
was a member of the Massachusetts Institute of Technology in Cambridge, Mass. from 
1941 to 1946. From 1948 he worked at the Brookhaven National Laboratory in Upton, 
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From 1924 to 1948, he was the successor of H. Kamerlingh Onnes and together with 
W.H. Keesom, was joint director of the low-temperature laboratory in Leiden. There, 
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ZEEMAN, Pieter, Dutch physicist, *Zonnemaire (at Zierikezee) 25.5.1865, + Amster- 
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13. A Nonrelativistic Wave Equation with Spin 


In this chapter we introduce a new method of deducing — in a systematic, 
theoretical manner — the Pauli equation for the electron with the correct g factor. 
In contrast to earlier derivations, we do not refer to empirical facts, but develop 
the new theoretical concept of the linearization of the wave equation. 

What is meant by this, will become clear in the next few sections. Concep- 
tually we are dealing with the same method which will be used later on in 
relativistic quantum theory to derive the Dirac equation from the Klein-Gordon 
equation. Levy-Leblond,' for example, performed such a linearization for the 
Schrédinger equation. Here, we partially follow his argumentation, nevertheless 
abandoning it at some points, in order to demonstrate the ideas more easily 
and clearly. 


13.1 The Linearization of the Schrodinger Equation 


First we abbreviate the Schrédinger operator by 
Bo ee. 
Sie ae OS é Alea) 


The free Schrodinger equation then reads 
Sy=0 . (13.2) 


It is asymmetric with respect to time (0/0t) and space derivatives (0/0x). This is 
because the former appears linearly in §, while S is quadratic in p. To remove 
this asymmetry, we try to construct a wave equation of the general form 


Oy =(AE+ B-p+Cw=0 . (13.3) 


Here A, B and C are to be linear operators (matrices) which still have to be 
determined, but which no longer depend on E or p. According to (13.2), we 
further require that the solutions w of (13.3) simultaneously be solutions of the 
Schrédinger equation. This means that the equations 


Gy = 0 (13.4a) 


1 JM. Levy-Leblond: Comm. Math. Phys. 6, 286 (1967). 
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and 

Sw =0 (13.4b) 
must be simultaneously valid. Then an operator 

O' = AE + B-p+C' (13.5) 


must exist so that the multiplication of (13.4a) by 0’ again yields the Schrodinger 
equation (13.4b), i.e. 


0'O =2mSs . (13.6) 


The factor 2m is actually arbitrary, but will prove useful later. The operators Ag 
B' and C’, introduced in (13.5), again shall not contain E and p. They still fae 
to be determined, as do A, B and C. If our procedure proves unsuccessful or 
impossible, we would be unable to find operators A, B and C. If we are successful 
(and indeed, this will become apparent), then the equation Ow = 0 represents 
a more or less equivalent wave equation to the Schrodinger equation, but linear 
in both E and p. Then we speak of (13.3) as the linearized Schrédinger equation. 

To construct A, A’, B, B’, C and C’, we multiply the expressions (13.3) and 
(13.5) for © on O’ and carry through, according to (13.6), a comparison of 
coefficients with 2mS. We obtain 


3 3h 3 
(4% + > Bip; + ) (4 + > Bp + C) = 2mE-— ¥ p? , (13.7) 
i=1 j=l k=1 
and thus 
A'A=0 A'B; + B'A =0 
AC+CA=2m BiB, + BiB; = — 26, 
C'C =0 C'B;+ BC =0 (i,j=1,2,3) . (13.8) 


To simplify these conditions, we define the new operators 


z ihe io 1 
B,=i{A+——C = — 
a ( see ) : a i= i(4 Lae ) ? 


- a te Zs = 
By, =A =—C pe Se 
5 oe Se Cae (13.9) 
So (13.8) becomes 
BB, + B.B, = — 25, , ((%v)=1to5) . (13.10) 


These relations can still be changed into another form which is more customary 
in relativistic quantum mechanics. Let M be a nonsingular, arbitrary operator 
(with MM~! = 1). Then we choose 

B,=Mj, B,= —$M~' (a=1,...,4) 


B,= —iM B,= —iM-! . (13.11) 


The following anticommutation relations follow by insertion of (13.11) into 
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sy 


(13.10): 
Vila + Veva = 20. » (4 P= 1,...,4) . (13.12) 


We should note that these relations are valid only for four operators #,, while 

in (13.10) there are five operators B, present. It can easily be seen that the 

definitions (13.11) automatically fulfil the anticommutation relations (13.10) for 

the case that one or both indices are equal to 5 (i.e. u = Sorv = 5,orju = v = 5). 
For example, we calculate 


aA A 


BB, + B.B,; = —iM~'M3,—9,M—'(—i)M 
= iG i= Otor v—1,2,3.4 , (13.13) 
and 
Bepe! BoB = — 1Me (iM 4+{—i)M(-i)M>'=27 = —2 


(13.14) 


So the five operators B, will be replaced by the four 9, and the arbitrarily chosen 
operator M (which indeed must not be singular because it must have an inverse 
operator M~'). 

The anticommutation relations (13.12) define an algebra, which is known in 
the literature as Clifford algebra. It can be represented by matrices and leads to 
the algebra of the complex 4 x 4 matrices (of particular importance in relativistic 
quantum theory) as a special representation. 

In order to obtain an explicit representation for the 7,, and thus for the Be 
we observe (13.12) more carefully, and immediately verify that* 


a = ead (13.15) 
Yale = —Vpva for a #B (13.16) 


must hold. In other words, the squares of the y, are | and the different y oper- 
ators anticommute. 

It follows from the former that the eigenvalues of y, have to be +1. Matrices 
representing the y’s have to be quadratic according to (13.15). And from (13.16), 
it follows that the traces of these matrices have to vanish, because for « # f, 
we have 


ie eh =o a VpVa) p =a 

try, = —ttygyevg= —tryayg= —trye ; (ie) 
and thus trace y,=0. In the last step we have made use of trace AB 
= >, , AixBy; = trace BA and the fact that YS = 1, according to (13.15). As the 
trace is just the sum over the eigenvalues, the numbers of positive and negative 


eigenvalues have to be equal. Therefore the » matrices have to be of even 
dimension. The smallest even dimension, N = 2, has to be excluded, because in 


2 From now on, we shall omit the operator sign on the ), operators, keeping in mind, 
however, their operator character. 
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2 x 2-matrix space, there is only room for 3 anticommuting matrices 6; and the 
unit matrix. The 6;, i= 1, 2, 3, are the well-known Pauli matrices, which 
anticommute according to (12.21). (In Exercise 13.1 we will show that the three 
G; and the 2x 2-unit matrix 1 completely span the space of 2 x 2 matrices.) 

Thus we conclude that the smallest dimension under which the conditions 
listed above on the 4 anticommuting matrices y, can be fulfilled is N = 4. 
Because of the properties of the Pauli matrices, described in Chap. 12, it is not 
difficult to give the following representation for the y,: 


0 Ga; : mo, 
yi i C 4 (i ad ie 2. 3) > 4 al é = 4 ¢ (sae) 


Here, 0, fl and the 6; indicate 2 x 2 submatrices; thus (13.18) is an abbreviation. 
Explicitly, (13.18) reads: 


Cecnonat 0 0 =i 
|e a 0 TO 
Wit One = 6 = 50 0 
OmOnG i 0 0 0 
0 eo i oo 0 
0 ja Dt @ « 
ell MEROA POs <0 ie es eee ae” 
W104 6 1O og =) 


The validity of relations (13.15) and (13.16) can easily be checked. For example 


Pe ec: a a Cran le 
ING. 0) 60) * Ge) Om eee 


OG; \ (0 a; O 6\/ 0 ¢. 
y+ras=l. } J Jj i 
ne ie a i) + i ale A) 
7 ee + 66; 0 ee 0 
0 G6,+ 66) \0 26; 
1 0 . . 
= 2( 1) ’ for LjJ= 1, Ds 3 5 (13.20) 
V4 + Vay; = 0 Gj 1 0 
risa A 7 0 a 


1 08 /Oaec, aeee6 
(5 calle, =(5 ee . (13.21) 
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et 


To obtain a matrix representation for the B, in accordance with (13.11), we 
choose 


a= 0 = 
i oO 
The relation 


mM-1=(" ° 
0 f 


is obvious. We continue calculating: 


A eS 0 1 0 6; 6; 0 
B; = My; = alee j= 
y (; ‘J G 3) ¢ : [Or ieeals 3 
if 2 0 af\/i O 0 -1_f 
Bai = 
orig 6 >) e = ,) ( 0 ) 


- ey oO 
B,= -iM= -i( ) : (13.23) 


=M"! , (13.22) 


Se = & © 
= & 2&2 © 
Sa Sf 2 = 
SS eo | © 


As was just mentioned, the Pauli matrices 6; and the 2 x 2 unit matrix 1 com- 
pletely span the space of the 2 x 2 matrices. This means that any arbitrary 2 x 2 
matrix can be expressed by the 6; and i. We will show this in the following 
exercise. 


EXERC (SE rr 


13.1 Completeness of the Pauli Matrices 
Problem. Show that every 2 x 2 matrix (41! 412) can be expressed by fl and @;. 


U21 422 


Solution. First we write down the proposition 
yy ou = 
(Er Be Badr at 


U2, U2 
1 Ons PW ee Le 
Ono elo 4 NG e 


_ ((@34+44) (a1 — ia) ) (1) 
~ \(a, + jaz) (-a3+a4)) © 


II 

Ss 
ra, 
= ©& 


Both matrices have to be equal in each element. Then we get the following 
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system of equations: 
Uy, = Oa, + Oaz +a; 4+ a4 
Uy> = a, — ia, + Oa3 + Oa, 
Uy, = a, + ia, + Oa; + Oa, 
Uy, = Oa, + 02g, 4+ a, , (2) 


with the determinant of coefficients 


(0 hl 
_— 0 

=440 , (3) 
1 1 0 0 
Oe 0 at a 


which is always nonzero. Hence, a nontrivial solution always exists; i.e. not all 
coefficients a; vanish, proving the proposition; 6; and 1 span the whole space of 
2 x 2 matrices! 


ERS! == See Ce 


13.2 A Computation Rule for Pauli Matrices 


Problem. Let A and B be arbitrary vectors. Prove the relation 


(6-A)(6-B)= A-B+ié-(AXB) . (1) 


Solution. The commutation relations for the 6; are 
6:6; = 1:0, + 0; , Where 
1 even permutation of 1, 2, 3 
Eijx =( — 1 odd permutation of 1, 2, 3 (2) 
0 otherwise 
Addition (or subtraction) then gives 
66; — 66; = 28:6, , 
66; + 6)6;= 20; . (3) 
We write out the scalar product 
3 3 
(6- A)(6-B) = (s 0A) ( y 43) : (4) 
i=1 j=l 
For the individual components we can write 


6;A ;6;B; => A,B; (18; jn, + 6;;) 4 (5) 
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and 
\ A,B; a \ A,B; 
ij i 


is just the scalar product A- B. In the first term, the sum can be expanded over 
k without making any changes: 


YS bijx Ai Bib, = yy, £1; A, BjGx ) (6) 
Uo di 


ijvk 
because, for example, for i,j = 1, 2, k has to be equal to 3 and the additional 


terms of the supplementary summation over k with k = 1, 2 vanish identically. 


Now ¢;;,4;B; are just the components of the vector product A x B. Therefore 
we have 


S. bij, ABO = yy €ijx A; BjG, 
ai Eons 


=) (Ax B),6,=6-(AxB) . (7) 


Altogether we get 
(¢-A)(6-B) = A-B+ ia-(Ax B) 


It now follows from (13.3) that the wave function 


cn) 

v=("\= ie (13.24) 
x val 
1G 


must have four components, because A, B and C are 4x4 matrices. Here, 
op = (G1) and x = (7!) are two-component spinors, which together form the four- 
component spinor y. 7 7 

We now solve (13.9) for A and C: 


A=4(B;—iBs) , C= —m(Bs+iB,) , (13.25) 
and thus 

ss 0 - Oem 

A= -i(; ) and C= 2mi( | : (13.26) 


In the next step, the matrices A, Band € are inserted into the equation of motion 
(13.3), giving 


fo Ws fe Oe (0 1\1(9\_ nee 
FC eG Jeng Qe 


Writing this matrix equation by components, we obtain the coupled system of 


329 


Exercise 13.2 


330 


13. A Nonrelativistic Wave Equation with Spin 


equations for the two-component spinors x and @: 
6-po +2miy=0 6-py—iE@=0 , (13.28) 


where 6 is the vector with the components 6;: é = {6,, G2, 3}. 


LS. SSS aS 


13.3 Spinors Satisfying the Schrédinger Equation 


Problem. Show that the two-spinors @ and x satisfy the ordinary Schrédinger 
equation. 


Solution. We first eliminate y = — (6-p/2mi)@ and get from (13.28) that 
| — iz CEP 9-0 (1) 
2m 


Since (6+ p)(é+p) = p’, we obtain 


a2 
(e-)o=0 (2) 


This is the Schrédinger equation for ¢. 

Now we eliminate Ey =(6-p)y/i from the second equation in (13.28), 
and insert the result into the first equation in (13.28). Multiplying that result 
by E yields 


[(/i)(é-p)(é-p) + 2miE]y =0 or 


a ee 


Therefore x also satisfies the Schrodinger equation, as was to be shown. 


In Exercise 13.3 we show that the four-spinor yw = (%) of the linearized 
Schrodinger equation (13.3, 28) indeed satisfies the ordinary Schrodinger equa- 
tion, as we required. Therefore the energy eigenvalues are in both cases also 
E = p?/2m. After eliminating x, the corresponding eigenvectors take the form 


_ ” | 
- . — 6: | (13.29) 


Here, it would seem that the wave function w with the lower component 
y contains redundant information. That this is not valid in general will now be 


demonstrated by considering the coupling with an external electromagnetic 
field. 
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13.2 Particles in an External Field 
and the Magnetic Moment 


The gauge invariance of the Schrédinger equation requires the substitution 
(minimal coupling — see Chap. 9) 


eo oe : : 
ee cViaseand) —ih¥ > ihV—-A(mt) ; (13.30) 
ot ot c 
or, in Lorentz covariant notation, 
0 0 e 
— jh— > | —ih—-—-A =1,2 : 
i an ( 1 mc , oe! 9 > oh ¢ (13.31) 


where the four-potential is given by 
A={A,} ={A,iV} . (13.32) 


Here, ¢ is the electric charge of the particle, V(x, t) is the Coulomb potential and 
A(x, t) is the vector potential. Let us remember the essential argument. A gauge 
transformation is described by 


Ai, = A, + — (13,33) 


with an arbitrary function f(x,). The minimal coupling (13.30), together with the 
phase transformation for the wave equation 


y= vexo] = fos (13.34) 
. ihe 
then lead to 

._ O eae moO e e of eer 

( = a — <4) = ( — ae _ ae = <P \yexpl (e/ihc)f ] 
~ ( Se “Ayu [expt — (e/ihe) f] 
Om, aC 
+ (22 : <<F yexol = (¢/ihie)f) 
COM mrOes, 


: ( eee “4,)u [expt — (¢/ihe)f] 
1! (13.35) 


This means that a gauge transformation can be absorbed with the state- 
independent phase exp[ — e/ihc) f(r, t)] and therefore does not change the 
physics (matrix elements, expectation values etc.). So the minimal coupling 
(13.30) leads to gauge-invariant quantum theories. 

With (13.30), the free equations of motion (13.28) become 


+-(p ~£4)) _i(E—eV)9 =0 


Sa 
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se SE EE ee 


#-(0 = <4) + iQmy=0 . (13.36) 


Again we eliminate y = — [6-(p — eA/c)/2mi]@ and get 


Z 1 e e 
Ae a pe alee = , 13.37 
| HE) ale (7 : Ja (6 z ) |e ( ) 
|e —ev—L4-(0~£4)e-(p—£4) |= (13.38) 
2m € Cc 


Using once more the identity 


(6-nx)(6-n) =n? +i6-(nxx) , (13.39) 


we thus obtain 


(13.40) 
The last term reduces to 
(p-£4)x(a—£4) = ~<( pxA+Axp) 
€ c c 
e re ; Cys 
an px AXP A De 
(13.41) 
so that at last (13.38) can be written as 
= (Mp ela Ne) aenlc ona a 
E—eV—-—|p—-A) + =—6é6:-(pxA) |9=0 . (13.42) 
2m Cc 2mc 


Now p = —ihV and B= Vx A. Hence, (13.42) becomes 
5 1 i g h 
bey = |p een ean oro me (13.43) 
2m € 2mc 


This is just the well-known Pauli equation! — See Chap. 12. 
The last term in the equation of motion (13.43) is the interaction energy of 
the magnetic field with the intrinsic magnetic moment of the particle 


p=. (13.44) 


or, because the spin operator of the particle is S= (1/2)6, 


eh ~ a p 
#=—_S = Gapin ftp = 2}tpS (13.45) 
mc 
The factor g,pin 18 called the gyromagnetic ratio or gyromagnetic factor and turns 
out to be twice as large as that coming from the orbital motion. The ratio 


13.2 Particles in an External Field and the Magnetic Moment 


Jspin/Gorvit 18 called the spin-Lande factor g,. For the particle in question, g, is 
therefore 2. 


Thus a completely nonrelativistic linearized theory predicts the correct 
intrinsic magnetic moment of a spin-4 particle. 


In contrast to this, almost all textbooks falsely claim that the anomalous 
magnetic moment is due to relativistic properties. The existence of spin is 
therefore not a relativistic effect, as is often asserted, but is a consequence of the 
linearization of the wave equations. This can be philosophically expressed as 
follows: obviously, the good Lord wrote the field equations in linearized form, 
i.e. in the nonrelativistic case, as a system of two coupled differential equations of 
first order, and then coupled the electromagnetic field minimally. He did not 
write them as a differential equation of second order (the Schrédinger equation). 

We have successfully derived the Pauli equation from the Schrodinger 
equation. Whereas in the heuristic derivation of the Pauli equation presented in 
Chap. 12 the spin degree of freedom was introduced “ad hoc”, in the derivation 
presented here we have only postulated the linearization of the equations of 
motion. Everything else that followed was just a consequence of this postulate. 
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14. Elementary Aspects 
of the Quantum-Mechanical Many-Body Problem 


If we consider a system of more than one particle, we derive its Hamiltonian, 
describing it quantum mechanically in the usual manner from the Hamiltonian 
function of the system in classical mechanics. The Hamiltonian function 


N 2 
H= ), (z+ Vals 0)+ » Vielris re) (14.1) 
ay en ah 
describes a system of N particles with mass m;. Here, V;(r;, t) is the externally 
given potential (the so-called one-particle potential), in which the ith particle 
moves; it can, for example, mean the external electric potential. Vj,(r;, r,) stands 
for the interaction potentials between two particles i and k; it can, for example, 
be their mutual Coulomb interaction. To get the Hamiltonian, we replace the 
momenta by the corresponding differential operators 


h 
ep ee NE (14.2) 
i 


where the index i of the nabla operator specifies that the gradient has to be 
determined at the location of the particle i, i.e. V; only acts on the coordinates 
of the ith particle. Consequently the momentum operators of different particles 
commute, ie. [p;, pj ]_ = 0 for alli, j. Thus the many-particle Hamiltonian reads 


mee, 
= 2 (- 7.44 ViAr;, n)4 y Ver ry) (14.3) 
ial 2m, ifk 


This is obviously a generalization of the Hamiltonian for one particle. We can 
now formulate a many-particle Schrédinger equation 


Ay =incy | 


where the wave function now depends on the 3N coordinates of all particles and 
on time: 


ey) = Wey ets Xn s View Zea ws DNs 2y, C) : (14.4) 
The treatment of this many-body problem confronts the same difficulties in 


quantum mechanics as in classical physics because of the complexity compared 


to the one-particle problem. 
The wave equation is defined in a space with 3N dimensions, in the so-called 
configuration space of the system. The name of this fictitious space originates 
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from the fact that the specification of the coordinates of a special point in this 
space means the specification of the three-dimensional coordinates of the posi- 
tion (x,, yx, 2,) for all particles of the system (k = 1,2,...,N), and thus deter- 
mines the state (configuration) of all particles in three-dimensional space. 
Therefore a point in configuration space with 3N coordinates (x1, 1, 21,---, 
Xn, Yn» Zn) is also called the configuration point of the system. 

We denote an infinitesimally small volume element in the configuration 
space by dV: 


aV= dV, dV,...dVy...dVy => dx, dy, AZ nO XS, dy, dz,...dXn dyn dZy 
(14.5) 
Then the quantity 


W(X) Viseis esos Vin Seen SNe Neen CCV us Wd 1a (14.6) 


is the probability that the system can be found at time t in the volume element 
dV of configuration space. This means w is the probability density of the con- 
figuration of the system, in which at time t the coordinates of the first particle lie 
between x1, X; + dX13.¥1,¥1 + dy13 21, 21 + d2Z,; and of the kth particle between 
Xs X— + AXKS Ves Ve + AVe3 Zn5 Ze + dZ,3 etc. Besides the volume element, we also 
examine the volume elements in the subspaces of the kind dQ,, dQ,;,...etc., 
which are defined by 


dV= dx, dy dz, dQ, = dl, dQ, 
ally = aby, dy, dz, dx; dy; dz; dQ,; = dv, dV; dQ; Se Cl: (14.7) 


Integrating (14.6) with respect to the coordinates of all particles, excluding the 
particle k, 1e. over dQ,, we thus find that the probability density of the Ath 
particle lies between x,, X4 + dXx3 Vix Ve + GVx3 Zu, Z_ + dz; and all other par- 
ticles are in arbitrary positions. In other words, we find the probability in such 
a way that the kth particle is near a given position in space. Denoting this 
probability by w(x, yx, Z,, t), we obtain 


W(Xks Vio Zks t)dXy dy, dz, = dx, dy, dz, ( WrWdQ, . (14.8) 
In a similar way, the quantity 
W(Xx, Vo Zk» Xj> Vj Zj> t)dx, dy, dz, dx; dy; dz; 


is the probability that the kth particle lies at the point x,, y,, Z, and the jth one, 
simultaneously, at the point x;,, y;, z;. If we know the wave equation W expressed 
in configuration space, we thus can determine the probability of a given 
configuration (14.6) of the system, the probability of the position of any given 
particle (14.8) and, finally, the probability of the position of any given pair of 
particles (14.9) etc. In the same manner, the probabilities for the value of an 
arbitrary quantity can be calculated according to the general formulae of 
quantum mechanics by expanding yw in terms of eigenfunctions of any operator 
of interest to us. 


14. Elementary Aspects of the Quantum-Mechanical Many-Body Problem 


We assume that the wave function w(x,,...,2Zy, ¢), like the wave function for 
one particle, satisfies the Schrodinger equation 


ih =Hy , (14.10) 


where H is the Hamiltonian (14.3) of the particle system. As stated earlier in 
(14.1), in analogy to the classical Hamiltonian for a system of N particles with 
OME SSKES) is coo g Mla coo glliNic 


A 


Nfs N 

Pi 

H= y (Z ae VAG Si is Se 1) {ete >, Vauern Vin Zie ans ek) oes 
pane i#k=1 

where V,(x;, ;, Z;, ¢) — as just mentioned — is the potential energy of particle i in 

the external field and j,(x;,...,2,) is the interaction energy between the par- 

ticles i and k. The Hamiltonian takes the following form: 


N h2 N 
H= oe (-3.7 a Viixi, Viv Zi> 0) ae Ss Vin Xi, Vis Zis Xko Ve> 2x) > 


2m, reas 


(14.11) 


™ z 2 akg f 62 
~ Ox? Oy? © dz? 


is the Laplace operator acting on the ith particle. The Hamiltonian operator 
can also be written down in the presence of a magnetic field and spin. It is equal 
to the sum of the Hamiltonians of the single particles plus the terms which 
determine the mutual interaction. 

From (14.10) we can get the equation of continuity for the probability w 
in configuration space. To find it, we multiply (14.10) by w* and subtract the 
corresponding complex-conjugated equation. Taking into account the structure 
of the Hamiltonian (14.11), we get 

h2 N 


ee * re oot iL We 254% 
ih = (w vis 5) oe Viw WV; W*) 


Setting 
ih 
h=—WUVi* — WV), (14.12) 
2m; 
where V;, is the operator with components V; = (0/0x;, 6/dy;, 0/0z;), we can thus 
write (14.12) as 


CWaehince etna ) 
Ot 


This equation shows that the change in the configuration probability w is 
determined by the current of that probability. Hence, j; is a function of the 
coordinates of all particles (and of time) and represents the current density 
caused by the motion of the particle i if the coordinates of all other (n — 1) 


N 
oF Y) div; fit, «+s Fis ++ tN: ies (0) ° (14.13) 
i=1 


307) 
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particles are fixed. To obtain the current density of the ith particle with the other 
particles in arbitrary positions, (14.12) has to be integrated over all coordinates, 
except those of the particle i, 1e. 


Ii(Xi, Vis Zi» Ee ee a Zjn-++52N, t)dQ; : (14.14) 


This current density also satisfies the equation of continuity, but now in 
three-dimensional space, i.e. if we integrate (14.13) over dQ;, we get 


0 0 0 
ee WAST 5 so 4 Sans UNG a sw eh ae Wey ze) 
Moreover, 


N N 

Y Idiv. jdQ; = fdiv; j,dQ)+ Yo divejy dQ; 

vei efi 
The volume element dQ, [see (14.7)] contains the coordinates of all particles 
with the exception of particle k. The integrals of the form | div; j dQ; can be 
transformed into surface integrals and they are, if w vanishes at infinity, equal 
to zero. In the integral { div; j; dQ;, we are differentiating and integrating with 
respect to different variables. Therefore we have 


f div, j; dQ; = div; [J dQ; = div, Ji(X;, Vis Zin), 
where (14.14) has been used. We thus obtain the continuity equation for each 
individual particle: 


MES ap div Ii(Xi, Vis Zis t) = 0 (14.15) 


in three-dimensional space (x;, y;, Z;). 


14.1 The Conservation of the Total Momentum 
of a Particle System 


In classical mechanics, only the total momentum of a particle system under the 
influence of internal forces remains constant. Thereby the centre of mass moves 
in a straight line with constant velocity according to Newton’s law. But if there 
are external forces, then the variation of the total momentum within a time unit 
is equal to the sum of all forces acting on the particles of the system. We will 
show that these principles of classical mechanics also retain their validity in the 
domain of quantum phenomena. For this purpose we assume an operator of 
total momentum for all the particles of the system. Naturally, by operator of 
total momentum of the entire particle system we mean the sum over the 
individual momentum operators fp, of all particles k = 1,2,...,N: 


N 
b= VR=-ihkYYV, . (14.16) 
3 S 


14.1 The Conservation of the Total Momentum of a Particle System 


Let us calculate the time derivative of the momentum operator p. According to 
the general formula of quantum mechanics (see Chap. 8), it is 

dp 1.4 a 

Se SiG = ED) 

7 p Pp — pH) (14.17) 


Inserting H from (14.11) and noting that p commutes with the operator of the 
kinetic energy 


on h2 N 1 

ae Lee 
we thus get 

dp N N N N N N 

a (C24, 2") (2 2+, %%)| 

k=1 k#j=1 P= t=1 k=1 kz j=l 
(14.18) 

Furthermore, 


n( ¥ v = (s vi) Mi = — Vi VilXes Veo Ze) (14.19) 


because V;(r;,) depends only on the coordinates r, of particle k. 

Last of all, we calculate the commutation of the operator Y-, V; with the 
interaction energy of the particles Y, . ; 4;. Thereby we assume that the forces 
between the particles depend only on the distances between the particles 7,;, so 
that %; = \;(m;). Then only those operators V, ofthe sum Y_, V; act on ,;, for 
which i = k or i=j; ie. the pair V, + V; acts on Kj. 

- Therefore we only examine 


W(Vi + Vi) — (Ve + Wg = — Vis — Vile - (14.20) 
But now 

Wahi Vit = Wh Ge Wires ae 
Consequently, we have 

Vi Wj t+ VjiVYji=9 : (14.21) 
This is simply Newton’s law, according to which action = — reaction. From this 


it follows that the commutation of the operators (14.19) is identical to zero. We 
thus get 


iat N 
op = — y Vo ott) (14.22) 

dt k=1 
ie. the time derivative of the total momentum is equal to the operator of the 
resulting force, which acts on the system by external fields. This law is analogous 
to the classical law of momentum conservation. The only difference lies in the 
fact that in quantum mechanics it is not formulated for the actual mechanical 
quantities but for the operators representing these quantities, and therefore for 
the mean values of these quantities (see Ehrenfest’s theorem in Chap. 8). If there 
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are no external forces (Vj, = 0), it follows from (14.21) that 

dp 

ee 14.23 

re (14.23) 
Hence, the total momentum of a system of particles, interacting mutually, ts 
conserved in the absence of external forces (14.23). 


We recall that the operator equation (14.23) means. 1: The mean value of 
the total momentum does not change with time. 2 The probabilities w(p) of 
finding a certain value of p remain unchanged too. 


14.2 Centre-of-Mass Motion of a System of Particles 
in Quantum Mechanics 


In the following, we show that the centre-of-mass motion of a system of particles 
does not depend on the relative motion of its constituents. This fact is well 
known in classical mechanics and ts also valid in quantum mechanics. 

We consider the Hamiltonian H, which only takes into account the influence 
of the inner forces (two-body forces ,;(1%;)): 


& h? ~ 
i Se W , (14.24) 
where 
ss N 1 N 
T=) —WVi , W= ¥ Wins) - (14.25) 
k=1 My ik=1 
ak 


We express the Hamiltonian in terms of an adequate coordinate system, consist- 
ing of the centre-of-mass coordinates X, Y, Z and the 3N — 3 relative coordi- 
nates. The Jacobi coordinates, already introduced in Example 9.6, suit this 
purpose. As we recall, they are defined by 


mM, X1 
c= — XX. =X, — X2 
My 
My Xy + MyX2 
E> = oe 
my, + Mo 
J 
» My X, 
k=1 
ops j — Xj+1 
ie 
k=1 
N 
SY my Xp 
k=1 aa 
ie ee (14.26) 


14.2 Centre-of-Mass Motion of a System of Particles 


where M = \_,m, denotes the total mass of the system. Similar expressions 
can be obtained for the y and z axes: 


ae eee os Si I 8 i= ae (14.27) 
ym, 
k=1 
j 
y) my2Zy 

6 = yr, Gv BZ. (14.28) 
ym 
k=1 


These are generalizations of the relations between the centre of mass and relative 
coordinates of the two-body system. Important is the principle of construction: 
the Jacobi vector €; = {&,, 1;, ¢;} is the vector from the centre of mass of the first 
j particles to the (j + 1)-th particle. Figure 14.1 illustrates the situation. 


N-1 
Pov + y ma , where (14.29) 
fea Pa 
62 62 62 ai 62 aye 


(14.30) 


=" Gee ne ke ae 
denotes the Laplace operator of the centre of mass of all particles, and 
a? nel We 


the Laplace operator of the Jacobi coordinates €; = {&, nj, Gj}. The reduced 
mass 4; is given by 


v2 (14.31) 


ee ee (14.32) 
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Fig. 14.1. The Jacobi coor- 
dinates €, in the case of the 
vector é;. It points from the 
centre of mass R, of the first 
5 particles to the position 
vector r, of the 6th particle 
(M,;=m,+m,+m,7 mM, 
+ ms) 
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where m,,...,™y are the masses of the N particles. 
The Hamiltonian of (14.24) can be rewritten in the form 


i he N-1 p2 
les aro 2, 2p VIF Mtns Sumas ty ttt Sty oes bvaa) 
(14.33) 
Using (14.30) it follows that 
~ (fe aay ee a ae 
SE eee 14.34 
>= 3M 2M ( + oy? * a) Y 


represents the kinetic-energy operator of the centre of mass of all particles. The 
kinetic-energy operator of the relative (inner) particle motion is given by 


ee (14.35) 


Looking at (14.33), we see that the interaction energy does not depend on the 
centre-of-mass coordinates Ey = {Ey, ny, Cy} = {X, ¥, Z}. It depends, accord- 
ing to (14.25), on the relative distances between the particles only. But the 
relative coordinates can be expressed by the first (N — 1) Jacobi coordinates 


€1,---,€n—1, which follows immediately from the relations (14.26). If we trans- 
OTM G45 ee -oGW 1s 1€. (Gir. SN=w Tipe 1p 1 a OW amr Bee 
formation to arbitrary relative coordinates q,q,...,43n—3, the operator 


T, remains unchanged. Therefore we can generalize (14.33) to 


ie h2 ie 
A= Sy + HR(Gi5 4a dan 3) (14.36) 


The Hamiltonian of the relative motion Hg does not depend on the centre- 
of-mass coordinates; thus the wave function of the system separates into a 
relative part and a centre-of-mass part. In the next step we introduce the 
operator of the total momentum, 


2 Jee) ~ ere) 
Py = —ih— , Py = —1h#— 


- ee 
ax ae dee ae 


aoe (14.37) 


so that we can write the kinetic energy of the centre of mass in the form 


ie 


A 


OS ee ee: gale Ge Ge h? 
2M 2M ~ ~9M\ax? ” ay? @z2) «aM 

(14.38) 
The wave function of the system separates according to (14.36), and we write it 


as a product of the centre-of-mass part #(X, Y, Z,t), with the centre-of-mass 
coordinates X, Y, Z, and the relative part W(q1, 2, ...,d3n—3.) Thus we have 


WX, Vez, Gi, Ua, +: 3 Gano 38) — P(X, YZ, thw, eo an = 35 l) 
(14.39) 


14.2 Centre-of-Mass Motion of a System of Particles 


If we insert (14.39) into the Schrédinger equation, we obtain 


ne bak 
oe WY = ih 5 Pte WOZ nee Oyu as!) 


y 


H 
h? * 
(- rae ae Halas. .day-) 


«(ex ug Wine aN a; 0) 


ad ew 


ih + ihg i) ea, (14.40) 


ot 2M 
Dividing (14.40) by @y (for dW #0) gives 
o, eee a 1 h? ie 
LS —_—= —-~~—_YW = 
1 igh we he Ao ee 
After reordering, we get 
Oo on i PaO he hg 
h—+—V’¢]—=( -—ih— —= 
( ae OAT 6) ( in + thw) 3 E 


This equation is fulfilled if both sides are equal to a constant E. Then we have for 
P(X, af Z, t) and W(q1, G2>-++s I3N—3> t) 


8d poo 

oC A 

ib SY = Fey — EY (14.42) 
o 


The first equation describes the centre-of-mass motion of a system of particles 
with total mass M. If no external forces act on the system, the centre of mass 
moves like a free particle of mass M. The simplest special solution is given by 
a plane wave (a de Broglie wave), i.e. 


(X, Y, Z, t) = (2nh)~3 exp E (gee — Py = P22) | 


= nh)-*P exp] ~ FP X- bat) (14.43) 


By inserting @ in the Schrédinger equation (14.42), we can identify the compo- 
nents of P = (Px, Py, Pz) as the eigenvalues of the total-momentum operator. 


For the eigenvalue of the kinetic energy of the centre-of-mass motion E,, it 
follows that 


1 
E,= vile: +P7+PZ+E . (14.44) 
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The additive constant E is unimportant and can be chosen equal to zero (E = 0). 
The wavelength of the de Broglie wave is given by 


inca. P= PETES PE (14.45) 
{ 
V being the velocity of the centre of mass. 

We can now deduce from (14.43—45) that de Broglie waves (14.44) are not 
oscillations connected with the internal structure of the particle system, but 
represent the general quantum-mechanical motion of free particles (or, in our 
case, a centre-of-mass motion, i.e. the motion of the system as a whole) without 
external forces. 

The essential and interesting aspects of the many-body problem concern the 
inner degrees of freedom, described by (14.42). The centre-of-mass motion is, 
as in classical mechanics, a rather trivial aspect. It is only important if all the 
particles of the system — which, as a result of inner binding forces, are confined 
relative to each other — are deflected in an external field, or interact with other 
complex systems. In the latter case we speak of cluster structure and mean the 
splitup of an N-body system into various substructures. 

Cluster structure plays an important role in the breakup of a nucleus with 
A nucleons into two fragments with nucleon numbers A, and A, (A, + Az = A) 
or into three or more fragments. This is called two-body (binary) or three-body 
(ternary) etc. fission. If one of the fragments is very big and the other one quite 
small (e.g. A > (A — 4) + 4 or A — (A — 12) + 12 and A & 220), one speaks of 
radioactive decay. The most famous form of this is « decay, in which an « particle 
is emitted (*He nucleus). 

More recently, so-called cluster radioactivity has been discovered, in which 
'2C nuclei, 1°O nuclei, 7+Ne nuclei, 77S nuclei etc. are emitted. It was theoret- 
ically predicted’ and 4 years later, experimentally confirmed.* 

Returning to our calculations, we finally get for (14.39), after separation of 
the centre-of-mass motion (14.43) in the general form, 


V(X, is Z, G15 -++593N-3> t) 


= (Ong)? exp] — 1 (P-X— Bat) Wa emo) (14.46) 


14.3 Conservation of Total Angular Momentum 
in a Quantum-Mechanical Many-Particle System 


Again we consider a system of N particles and denote the components of the 
orbital angular momentum of particle k in terms of Cartesian coordinates by 


' A. Sandulescu, D.N. Poenaru, W. Greiner: Sov. J. Part. Nucl. 11, 528-541 (1980). 
* HJ. Rose, G.A. Jones: Nature 307, 245-247 (1984). 


14.3 Conservation of Total Angular Momentum 


Sie [*, T*). The position vector of the kth particle is x, = (x,y, 2.) We 
then have 


pe ein (= d 
: Nap ae (14.47) 


The components of the operator for the total orbital angular momentum 
f=(I,, _ [) of the system are defined as the sum over the individual angular 
momenta, Le. 


Mee (14.48) 


In the following we will prove that the derivative of the angular-momentum 
operator equals the operator of the torque exerted on the system. oe to 
(8.6), the time derivative of a not explicitly time-dependent operator, e.g. [,, 


da ie 
= = (8, i (14.49) 


The Hamiltonian of the N-particle system with masses m,, m2, ...,my reads: 


N 

H= 5 (# + Vals vee )) + YY, Veils Veo Zia Gs jo Z) ~ (14.50) 
oy. 

As before, V, corresponds to the potential energy of the kth particle in an 
external field, and V,; is the interaction energy between particles k and j. We 
know from Chap. 4 (see section on angular-momentum operators) that every 
single component of the angular-momentum operator commutes with its 
square. Because the angular-momentum operators of different particles com- 
mute — they act in different coordinate spaces — they are not able to harm each 
other, e.g. [/*, fijj= = 0 for any k 4 j. Each component [* of a particle’s angular- 


momentum operator commutes with the square i? of the total-angular- 
momentum operator, Le. 


[ee] =] eas 1 eee (14.51) 


We also know that p2 commutes with [¢, Pee which can be verified, for 
example, for the x component: 


o- a? @? 
eye j? 
Di h° Vi. 1 (< ae aye aP =) 5 
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and therefore 
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Lore Se E a> Tea ok ay, in ay? AD ae oa. 
na he Oi &@ a @ @ je ra) P -) z 
= —_ —— — Z, —— ] — ——— —- |] —— 
Den Mo aa ey A On) Oe 
0 o7 ra) 6 Oa 0 : 6) - | 
ae, ee Se ee ee a eg ee es 
Bear, ae OV, E Ove Oz, a OZ, * Ov, 62) 
ie 
=in(F os pole ante 25 
k OZ, Tn 
0 P GP ra) 0 6 Io 
"On oye | “Oe "bcp Beg OV, Oy, Oz 
Sy 
se 62 As 62 
— —_—_—\———— we SS 
Zr Oy, Og Vk z2 « Oy, DI 


[v2.1 Ore (14.52) 


1.6. 


Let us now split the kinetic-energy operator in (14.50) into a translational part 
T,, along the radius vector r, and a rotational part (see Fig. 14.2 and also the 
section on kinetic energy in Chap. 4): 

a =. (bp 


—-—Vi=T. 
2m, k By oe 


14.53 
2m, rp ( ) 
Because each component of the angular-momentum operator of a particle 


commutes with (/,)? and with V7, it also commutes with (an according to 
(14.53): 


Fig. 14.2. The position vec- (ae h]=0 (§=x,y,z) . (14.54) 


tor ,, its angular mo- Jp order to evaluate (14.49), we need the commutators of [Y, f*] and [Vat 
mentum 4, and the external 


* (On ay 2): 
force f, = — V,V,(r,) acting 
the kth particl ae 7) 7) 
_ cae Kit fy= - in K(ne—ae)-(no-a2)n| 
OZ, OY, OZ; OV, 
: ov, OV, 
=ih| y, — — z,—— : 
1 (» ap 2% | , (14.55) 
and, analogously, we calculate: 
OV,; OV, 
Vo,0*) =ih| y,— : : 
[Vj iky=i (v a — Z, 7) (14.56) 


We express the derivatives in (14.56) in terms of the relative coordinate r,; and 
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have (see Fig. 14.3) 


ep = Ue = UF 


@ my 228 ( eae 
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Fig. 14.3. The relative vec- 
tor r,; between the kth and 
the jth particle is defined as 
@ y-y a 8 4-4 o feat 

OV, Vey Ory; J Oz. Pry Ory; , 


Similarly, we obtain 


According to the above relations, (14.56) reads: 


OV, Dp. = Be = Wp, OV, 
[Vf] = ih —# (m3 ade 21) = intern) - an) see 
Prj Prj 


or Pee Ory; Vej 


(14.57) 


Using the relations (14.51—53, 56 and 57), the time derivative of a component of 
the total angular momentum can be evaluated. According to (14.48), we obtain 
for the x component 


ae ny aV, av, av, 
= =~ ¥ (54 ——-2Z ‘)- SF Gay ay ee (14.58) 


dt k= il jk=1 ON ns Vij 
J#k 


The second part in (14.57) vanishes, since the terms of the sum change their sign 
by changing their indices, and thus cancel each other. The result is 


dl, a) Gl oy 

peace tae ee Pe 14.59 

dt Xs (» oe sa | (14.59) 

Similarly, we obtain for the other two components 

dl, a av, av, 

ae ae Se 14.60 

i ee (« Ox," dz) ° oe 

oe af ale ay, 

—— (az ae = , 4] : (14.61) 
R= Xk 


Thus we have proved the theorem, already known from mechanics, that the time 
derivative of the orbital angular momentum equals the torque of external forces 
acting upon the system (the torque of external forces). If no external forces are 
present, or if their torque vanishes, the total angular momentum is conserved: 


di, val, dl, 
aes = Sais, 14.62 
dt dt dt ( 


Thus for the case of vanishing external forces, the averages [,,/,, and /_ are 
constant, as are the probabilities w(/,), w(/,) and w(/,) of finding a fixed value for 
an angular-momentum component. 
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Including the internal spin s of a particle (see Chap. 12), the relations for the 
total angular momentum can be straightforwardly modified as follows: 


N se N 
h=-VE+s , §=LU+8) . f= Vrs) . 04.63) 
k=1 al =o 
where Sk, s*, S$ denote the projections of the spin of the kth particle on the 
corresponding coordinate axis. The spin operators are represented by 2 x2 
Pauli matrices. If no external electromagnetic fields are present, 1.e. if no forces 
are acting on the spin, the conservation law for the angular momentum remains 
unrestrictedly valid, since in this case the Hamiltonian commutes with each 
component of $*. 

The commutation relations of the total angular momentum of a system of 
particles correspond to those of the orbital angular momentum, because the 
operators ee Pee $* and $$ commute for different particle indices, and, 
furthermore, !* and §§ commute with each other, because these operators act in 
different spaces (coordinate space — spin space). Hence, 


Leh Star Lise vae 3 (bee aun, (14.64) 
ee ie oe (14.65) 


The eigenvalues of f* are equal to the sum of the eigenvalues of /* + $*. In 
Chap. 4 (see the section on angular-momentum operators) we found the eigen- 
value of the z component of the orbital momentum to be mh, with 
—lk<m*t < K(k = 01, 2,... represents the quantum number of the orbital mo- 
mentum of the kth particle). The eigenvalue of the spin is + h/2 [see (12.13, 39 
and 40)]; thus we have for the eigenvalues of j*%, the values hm“, m* being an 
integer multiple of 4 for particles with spin 4. For the z component of the total 
angular momentum, we have 


N N 

i SS hie eS Oe (14.66) 
k=1 k=1 

Equation (14.66) has to be interpreted as an eigenvalue equation; the index z is 

omitted. To determine the eigenvalues of j?, we introduce the eigenfunctions 


|jm> of f? and j, with 
J?\im> = J*|jm>, fz|jmy>=ml jm). (14.67) 


Neither J? nor m nor their relation to each other are yet known. To proceed in 
this direction, it is customary to consider the step operators j, and j_, which are 
defined as 


o * = 


diss =j,+isy py He = i, — ify : (14.68) 


Using the commutation relations (14.64), the validity of the following relations 
can easily be verified. 


[ide] =Ci f+ iL), = — thy, = inj, 


ae (14.69a) 
and, similarly, 


= hia (14.69b) 
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We rewrite the commutation relations (14.68) and (14.69) in matrix form, i.e. we 
multiply both equations from the left by a bra, and from the right, by a ket 
vector. We choose a basis in which j, is diagonal: j,| jm) = hm|jm>. We calculate 
<jm'|...| jm") and obtain 


(T+ )rntme fim!” — fim! (js mame = — AG etme 

(F— Jovem Far!” — fam! (jee? = A ae’ (14.70) 
or 

(J+ )m'm (nm —m' + 1)=0 

Core = mn =A) 0. (14.71) 


Obviously, the only nonvanishing matrix elements of /; and f_ are given by 
(i+)m.m-1 and (j—)mm+1- The operator of the square of the total angular 
momentum can be written in terms of j,j_ or f_/y: 


jal- = =) = eee ee ee ee 


heer sean (14.72) 
Completing the square yields 
2 2 2 2 
Tse = j re Mpa j We (14.73) 
4 gee! 4 2 
Considering the diagonal matrix elements <¢ jm| ...| jm) yields 


-_ | hi? 
Cie eee = (eee — 00) = rrr as ue ar 4 ee h?(m aa ae 


2, 


; h 
(te an = elle) te = i a 4 a h(m as oe 0 (14.74) 


We assume J” to be a given, but still unknown, positive semidefinite quantity. In 
the following we denote by m’ the lowest possible value of m, and by m’, the 
maximal value of m. From (14.74) it follows that, by making use of 


Ce) pn =0= (CS ea eee and (ae =0= (CES ee > 
we get 


h? h2 
eega  ! = 2) ; J yeaa +2) ; 
and therefore 


1 eee | 1 cape 
ae — SS 14.75 
m ar 5 Ud 5 ar he sr 4 ( ) 
In the equation for m’, we have chosen the negative value of the root in order to 
get the smallest possible value for m’. The difference m” — m’ + | is an integer, 
giving the number of possible z projections of the total angular momentum j. 
Setting m” — m' + 1 = 2j + 1 (in analogy to the orbital momentum), it follows 
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from (14.74) that 


Je | 
2g+1=2 ee eae) (14.76) 
Since for the z projection of the total angular momentum m, both positive and 
negative values must be equally represented, m” must be equal to — m’. Then, 
from m” — m’ + 1 = 2] + 1, we get 


m= +4,4+3,...,4/ . (14.77) 


Thus the relation for the total angular momentum has been proved to be of the 
eigenvalue form 


PH=WjiG+)) (14.78) 


os 


ja Hn orn) ae (14.79) 


Depending on the number of particles and on the spin, j has either integral 
values 0, 1) 2 3,-.), of is am integer multiple ot > 1c) = = ee ommine 
projections of the total angular momentum m we obtain 2j + 1 possible 
quantum-mechanical orientations with respect to an arbitrary axis (here, the 
z axis), namely m= — j, —j+1,...,/—1,j. Electrons have spin 4; therefore, 
for asystem consisting of an even number of electrons, j has integer values, while 
an odd number leads to an integer multiple of 4. 

To prove the eigenvalue equations (14.78) and (14.79) we have made use only 
of the commutation relations (14.64) and (14.65). Because the total angular 
momentum / and the spin § satisfy the same commutation rules, we obtain 
analogous relations for the eigenvalues of the corresponding operators: 


N 

i= > |, (14.80) 
on . 

P= hie =. eee (14.81) 

ihm, , |m|<l , (14.82) 
N 

— oa (14.83) 
k=1 

So ests | ees Calas : (14.84) 

$.hm, , |mj<s . (14.85) 


For given values of the total angular momentum /and the total spin s, j assumes, 
depending on the relative orientation of / and s, all values between |! — s| 
(antiparallel orientation) and | + s(parallel orientation): 


peas... (sl. (14.86) 


14.3 Conservation of Total Angular Momentum 


This is physically reasonable and is illustrated in Fig. 14.4.7 States with the same 
land s form a group of levels called a multiplet, which lie close together because 
of the weak /7-s interaction. From (14.85), it follows that for s </a multiplet 
contains 2s + 1 states. In other words, (14.86) tells us that there are 2s + 1 states 
in a multiplet. Consequently, the specification of j, | and s is essential to 
characterize the energy of the entire atom. Of course, there will be additional 
quantum numbers like the principal quantum numbers in the case of the 
hydrogen atom, but also others typical for the many-body problem. In analogy 
to the hydrogen atom, terms with / = 0, 1, 2,... are marked with the capital 
letters S, P, D, F. The lower index on the right indicates the j value; the upper 
index on the left gives the value of the multiplicity of the multiplet. For example, 
PRyemiaiks the tenn with aly — 5,5 —4,(2 x5 + 1 = 2) and “FP; ,5, the term 
ie (Oe le — 4) cic, Strictly speaking, the imdex at the 
upper left is redundant, because the multiplicity, always given by 2j + 1, is 
immediately deduced from the lower right index. 


EXAMPLE 


14.1 The Anomalous Zeeman Effect 


As an illustrative example of the angular-momentum algebra, we consider level 
splitting in a complicated atom with several electrons in a weak homogeneous 
magnetic field (the anomalous Zeeman effect). The interaction of the electrons 
with the external magnetic field B is given by 


W= —f-B , 
i (1) 


with the magnetic moment w# = e/(2m,c)(1 + 2$); m, is the electron mass. The 
anomalous g factor of spin, i.e. g = 2, is included. The coordinate system is 
chosen in such a way that the magnetic field B and the z axis are parallel. / and 
s denote the total orbital angular momentum and total spin. The operator of the 
magnetic moment can be expressed in terms of the total orbital angular mo- 
mentum (j = [8 


fi = Gj = e/(2m,c)(1 + 28) = e/(2m, oj + 8) (2) 
G = e/(2m.c){1 + F-8/LiGi + YI} (3) 


Here, we have made the assumption that in states | jm) only the component 
(s-HiG+ 1)]/ of the spin vector §, i.e. the component of $ parallel to the vector 
of total angular momentum J, yields a contribution. The normal component is, 


3 We will give a formal derivation of this, using the commutation relations only, in 
W. Greiner, B. Miiller:; Quantum Mechanics — Symmetries, 2nd rev. ed. (Springer, Berlin, 
Heidelberg, New York 1994). 


Bel 
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Fig. 14.4a—c. The addition 
of orbital (/) and spin (s) 
angular momentum. (a) 
Shows the maximal, and 
(c) the minimal, resultant 
total angular momenta; (b) 
illustrates an intermediate 
case 
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on average, zero; this is valid for vector operators (the projection theorem)*. 


With /? = 7? + §? — 28-7, we get 


e 2 2 a2 
= (4) 
2m.c 277 + 1) 
Because of the orientation of the magnetic field [B = (0, 0, B)], we only need 
fi, = Gj,. 


Let us assume that the magnetic field B is weak enough for first-order 
perturbation theory to be sufficient to calculate the effect of the magnetic 
interaction (1). We calculate the matrix element of the interaction in a basis of 
eigenfunctions | jm), in which the operators G and j, are diagonal, so that 


<N'j'Tm' |p, B| Njlm> 
<N’j'I'm' | Gj,B|Njlm> 
MGEhBomm Ow O;j'/(2MeC) (5) 
with g being the so-called Lande factor. From (4) it follows that 
iG+)-ll4+ 14+ s(s+ 

Ae 1) 
From Exercise 12.1 we know the Larmor frequency w, = eB/(2mc), and hence 


we obtain with (1), (5), and (6) the interaction energy of a particle system in 
a magnetic field as 


Gt: (6) 


(7) 


W = hm on _ M+ I= M+ 1) + sls + ‘i 


2j(j + 3) 
This means that in first-order perturbation theory the modification of the energy 
levels is 


Enjm = Ens a (eh B/2m,c)gm ’ (8) 
so that the shift between two neighbouring levels (Am = 1) is 
AE = (ehB/2m,.c)g ; (9) 


AE depends on the Lande factor (i.e. on j, /, and s) and on the intensity of 
magnetic field. For states with total spin s = 0 and therefore j = | (singlet terms 
of atoms with an even number of electrons), we get g = 1 and AE = ehB/2m,c, 
which is the normal Zeeman effect. 

Equation (9) is valid only for weak and homogeneous magnetic fields, 
i.e. for field strengths B, which cause a splitting that is smaller than the energy 
difference of the unperturbed levels (without fields). This yields the condition 


|ehB/2m.c|<|Eyj— Eyy| . (10) 


* See e.g. M.E. Rose: Elementary Theory of Angular Momentum (Wiley, New York 1957), 
Chap. 20. 
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14.2 Centre-of-Mass Motion in Atoms 


Problem. a) Take into account the motion of the nucleus in atoms; make use of 
the results obtained in the section on the centre-of-mass motion of a particle 
system. 

b) What are the modifications of the transition frequencies of the hydrogen 
atom disc ussed in Chap. 9? In other words: What is the true value of the 
Rydberg constant in the hydrogen atom? 

c) Determine the electron mass, making use of the relations between 
transition frequency and reduced mass in atoms with one electron. 


Solution. a) Taking into account the motion of the nucleus, the stationary 
Schrodinger equation reads: 


2 h2 C2 02 ae 
~ + Im; \ x? re V 
| x (a i Oy? al sa) | + Vir)yy 


= JB CG ieee) 9 (1) 
where m, is the mass of the nucleus with the coordinates (x,, 1,21), and m, the 
electron mass located at (x2, 2,22). The relative distance between the nucleus 
and the electron is 


(x; — x2)? + ("1 — y2)? + (21 — 22)" (2) 


(see next figure). 

We introduce Jacobi coordinates, corresponding to our general consider- 
ations of the centre-of-mass motion of a particle system (in the second section of 
this chapter), 


—— 


m,X, + m2X2 


Ge 5 ey ee €é,=On= gee Sh 
_ Biya Mya y 
in = Vi VaSyY s BS as 
Bn aR AA 
f= = 2 — 2 Cs = ly = —_—_—__ = 2 ’ (3) 


my, + M2 


applying the results for N = 2 on [see (14.25) ]. . 
We see that the Jacobi coordinates with index 1 represent the relative 
coordinates; those with index 2, the centre-of-mass coordinates of nucleus and 


electron. . ; 
The transformed Schrédinger equation and its solution follow immediately 


from (14.32): 


h2 aa ow oa h2 ow 
— 3M \ax2 | oY? dz?) 2p \ Ox? 


+ V(r) = EWX,YZ;%y,2) 


ay OW 
dy? o | 
(4) 


S55 


The coordinates involved in 
this exercise: r; points to the 
nucleus, r, to the electron 
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with 
M=m+m, , b=Mm,m/(m, +m) . (5) 


For w we choose a separation of variables by exploiting our knowledge of 
a freely moving centre of mass, i.e. we use a plane-wave for the centre-of-mass 
motion: 


WAG BS Se Nexp| = a(R ao 18, ae p.2) lots) é (6) 


N being a normalization factor. Inserting (6) into (4) yields the Schrodinger 
equation for the relative motion: 


h? oo ao 0-0 
ee pis ily ee Vi = ; 7 
(<8 Oy) 02. AE ease? ) 
with 
e=E—P2/2M . (8) 


Equation (7) describes the motion of a particle with mass pz (the reduced mass) in 
a given force field V(r). The quantity ¢ stands for the internal energy of the 
relative motion. E is the total energy, which contains the energy of the relative 
motion ¢ and the kinetic energy P*/2M of the centre of mass. In the section on 
the spectrum of the hydrogen atom (Chap. 9), we solved an equation analogous 
to (7), but on the assumption that the mass of the nucleus is very large compared 
to the electron mass m, > m,. Indeed, using (5), we see that here 4 ~ m,. In the 
case of the hydrogen atom, we found for the transition frequencies between the 
principal quantum numbers n and n’ that 


1 1 ; 
y= Raa — =| , with (9) 
= m,e* 
° Anh? 


The desired values for ¢ and ¢(x,y,z) correspond exactly to the quantities 
derived for the hydrogen atom, if we write p for m2. 

b) We have to replace m, by p in order to obtain an accurate value for the 
Rydberg constant. We then get for the hydrogen atom 
pe* MyM 


po (10) 


eee a 
a 4nh? m, +m. 


For a nucleus A of charge Z and mass ma, we have to replace the Coulomb 
interaction in the hydrogen atom — e’/r, by — Ze?/r; and we consequently 
obtain for the modified transition frequencies 


weer (A 
m Anh? \n'2 pn? 


i 1 
= ZR —— 
(a 2) > (11) 
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where 


je* Malle 


A = TF 6 Lt = 
4nh* ma + m, 


c) The fact that 4 assumes different values in different atoms was used by 
Houston to determine the electron mass through precise measurements of the 
H, and Hg lines of the hydrogen atom. The H, and Hg lines belong to the 
Balmer series, i.e. transitions that end in the n = 2 energy level (see Chap. 9). 
H, describes the transition from n= 3 to n = 2, Hg the transition from n = 4 
fon — 2. 

The transition frequencies of the H, lines can be determined in hydrogen and 
He™ (i.e. singly ionized helium with only one electron circling the He nucleus): 


1 1 5 
Ve = Ru( 3 3) =e 5 
1 1 20 
Vie = 27 Rue (= as 7] ee ee (12) 


From (12), a relation can be established between the frequencies, depending on 
the reduced masses: 


=~ i 


4 He — VH - lige — Hu (13) 


Vy Loy 


y —— 
Expressing jiy. and {4 by the masses my and my. of the hydrogen and helium 
nuclei, 

f mym mMy.m 
Hy >= = > HHe = —_—— > (14) 
bis) ae tides Mhye + Mm, 
we obtain for y 
Nye — My\m 
= MyHe H e (15) 
Mye + Mm, / My 
The spectroscopic determination of y allows us to compute the ratio m,/my 
according to (15), i. the atomic mass of the electron, for given values of my, 


and my. 
Houston found the value 
Mahia — 1838.2 a 18 2 (16) 


This method is also suitable for determining isotopic masses, because the 
different reduced masses cause a line shift in a quantum transition. The mass 
of the deuteron mp = 2my, which contains one proton and one neutron, was 
determined by this effect. 


SSS SSS eet 
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14.4 Small Oscillations in a Many-Particle System 


As the simplest many-particle system, we first consider two identical particles 
oscillating with small amplitudes about their equilibrium position. In this case 
we can expand the potential energy in a power series: 

— 


Wes. OVC 
Vix1,x2) = V(O,0) + x, (ete ) toe (ee 
1 x1=0 
_ 


ee] eee 
ne Oxaleeo DGS 
)+ ee (14.87) 
x1 =x2=0 


i a 
aac Ox, 0X> 


For a vanishing elongation, the potential energy is minimal (no forces exist in 
the equilibrium position) and can be set equal to zero. It then follows that 


av av 
V(0,0) =0 eae 


> > 
ts Ox, x1=0 0X5 


=e (14.88) 


x2=0 

The knowledge of the one-particle oscillator potential (see Chap. 7) suggests 

setting 
OV 


ev 
Ox, 


~ a 
sn Oa 


=, (14.89) 


x2=0 


if we assume equal spring constants, masses 4 and frequencies for the two 
particles. The interaction energy of the particles is taken to be constant in lowest 
order: 
fel 
OX, 0X5 


A. (14.90) 


x,=0 
x2=0 


We thus obtain in the case of small oscillations the following expression for the 
potential: ; 


2 2 
HO 9 oo Ho 5 


ee | ty 5 (14.91) 
The Hamiltonian of the system follows immediately as 

Z lia 0 snes Os 

lol = --(S+3) Seen + x3)+ NS (14.92) 


In analogy to classical mechanics, we introduce normal coordinates q, and q>, 
so that the potential energy V(x,, x2) can be represented by a sum of equal terms 
quadratic in q, and q,. The kinetic energy can be expressed by the squares of the 
momenta — ih(0/0q,), — ih(0/0q>). In general, normal coordinates are suitable 
for describing the eigenoscillations (normal modes) of a system, in which the 
restoring forces are proportional to the elongation of all particles, and the 
potential energy is thus a quadratic form of the elongations. 


14.4 Small Oscillations in a Many-Particle System 


For the system under consideration we set 
—— (41 + 42) = ( 
xg = Gir Gs) Xs — — : 
1 2 1 2 2 Ue dM — G2) (3) 


Now x, and x, can be expressed in terms of the normal coordinates. For that 
purpose the derivatives 


Oy _ OW Ox, | OW Ox2 | oy ow 
oq ~ ax, Oa, 8x5 0q, oD Ox, ie i 


war alaet? ny 4 and 


ag OOF 2 eh ee 
Cae Ve" 7 Ge 
mae pea est : (14.94) 
q2 2 Oxi OX 0X2 0x5 
are needed. The potential energy becomes 
ws A 
ee Pt xB) + bax, = 8g fee ACh ce) (14.95) 
with 
por=no~+A , pos =pog—A . (14.96) 
We obtain the Hamiltonian in normal coordinates as 
A h? O* be pot pos 
(=e oi eee ee Ee 14.97 
clael+ m2” a 


Obviously the Hamiltonian of two coupled oscillators (14.92) is transformed 
into a sum of the Hamiltonians of uncoupled oscillators with frequencies w, and 
@ . The wave functions and the energies of the system are obtained by solving 
the associated Schrédinger equation, which is 
my Hot », Baw po} , 
fe eee, aiid =Ey . 14.98 
Peraigm eed 3 oY = (14.98) 


Decoupling of this equation is achieved by the separation 


W(41.92) = Wilgiv2(q2) , E=E, +E, . (14.99) 


By introducing (14.99) into (14.98) and after dividing by W1 (4: )W2(q2), separate 
terms depending solely on q, or on q2 are obtained: 


WO, wot , 


con ial pected =E£E ; 14.100 
Ee ale 5) qi i ( ) 
h? Or p13 2 

cus Ee 2 ea (14.101) 
Oh BEE ch 5) q2W2 2W2 


We already know the solution of (14.100) and (14.101) from Chap. 7. These two 
equations describe harmonic oscillators with frequencies w, and a, respectively, 
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and the wave functions are given by Hermite polynomials: 


ne | exp )Hast i) (14.102) 
"hy 


with 4, = t@,/h and the “7 


een) eee, low (14.103) 


and, analogously, 


Wns By pes 2 ! Oe aes ete ) at Jaa) ’ (14.104) 


with the energies 


Spee) | aS Od (14.105) 


The eigenfunctions and energy eigenvalues of the whole system follow by 
inserting the last results into (14.99): 


Wnyno(41542) = Wns (41) Wna(q2) ’ (14.106) 


or 
Ex yng = h(n, + 4) + h@z(n. +4) , (14.107) 
wherefrom we deduce the ground state energy of the system as 


ha, he 
Eo) = —— + —_.. 14.108 
00 5 ah 5 ( ) 
Now we consider the probability of finding the normal coordinates q; and q in 
the intervals (g,,q, + dq,) and (q2,q2 + dq), with the aim of making a state- 
ment concerning the coordinates x,, x, of configuration space. The probability 
mentioned is described by 


w(41,42)dqidq2 = [Wns 91592)" dqidqz . (14.109) 


Correspondingly, the probability of finding the system in the coordinate space 
X1,X, In the intervals (x;,x; + dx,) and (x2,x2 + dx.) follows by using 


dq,dq> = S a dx G7 = = aadx 5 (14.110) 
Ox; OX, 


This implies that we have to reverse the direction of revolution of the region 
G*(q1,42) with respect to that of the region G(x,,x,) if a transformation of 
a surface integral from G(x,,x,) to G*(q,,q2) is carried out (negative sign of 
the functional determinant!). The surface element, of course, remains positive®, 


° Since volume elements are required to be positive, we should define the transformation 
(14.110) from one volume element to another one using the absolute value of the trans- 
formation determinant. 
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so that the probability 1s 


W (=; (<7 -F x2} ie ‘ 
nmin ae. 1 2) Ta — »)) 
(14.111) 


The generalization of these results to an N-particle system performing small 
oscillations about its equilibrium is straightforward. 

We denote the elongation of the kth particle by x,, ,, 2, and obtain the 
potential energy 


2 


Wik). x5 )dxyaxs — ax, dx 


{l 3N 1 = 
V=- > Cywiw,==w'Cw , (14.112) 
27, 2 
where 
Wi — Ome e ee ng Viel ons YING 215 229-24 oN) (14.113) 


stands for the position vector in the configuration space of all N particles. In 
analogy to (14.87) and (14.90), the coefficients C= (C,) are the second-order 
derivatives of the potential energy: 


ee fomp7 14.114 

;==—— . ri 

i dw, 6w; wis 0 J ( ) 
arV 

Ci = aaa 5 for i = i] 0 (14.115) 
OW? | wi=0 


As in the previously discussed simple example, we can now introduce 3N normal 
coordinates q,,s = 1,2,...,3N, which are related to the Cartesian coordinates 
by an orthogonal fei ioemanione 


ey es = 2) (14.116) 
=i 

and 

Din dn = 09= Deiat : (14.117) 
4 = (a,) is a matrix and its inverse 4 ' is equal to its transpose a-!= a" orto 
its Hermitian conjugate, if d contains complex elements: 

(4-14), = Sai" LRM nee (14.118) 
k 

from which it follows, together with (14.116), that 

iy =O aril (49) 


Since @ is an orthogonal matrix, the terms of the operator of the kinetic energy 
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also decouple in normal coordinates, if all particles have the same mass p: 


ow «3% aW aw ow pall 
= —— = YY aud = ay , 


> 


dq, 7 1=1 OW, 0G, OG, a1 
ow aa aN OW 

= Aa = Asi }Asm , 
Og? = OW, \ Ser 


OE eee os 
= eae, ; 14.120 
aq? y : OW dw, Asi Asm ( ) 


Now, using the orthogonality of a expressed in (14.117) and (14.118), the kinetic 
energy is calculated as follows: 


h? 3N 027 he 3N 07 


nse Sh oe 1 OW, OW Ww, dw) 
hi? 3N Brat h2 N 
= —— — = ve © 14.121 
a me Ow? ne : ( 


The index k at the gradient operator 


OC r ee 
(=| Sa 
OX, OVe OZ, 


refers to the particle number k, as in (14.113). The potential energy is assumed to 
be a bilinear form of the coordinates w; and w;: 


[es 1 
PS Ci Wi; ae TCw 
ye oe) — 7 oe (14.122) 
so that 
V=50'aCag 7 (14.123) 


To decouple the potential energy in normal coordinates, we require that 
aca =A , (14.124) 
where A is a diagonal matrix of the form 
Ay 
A 
: = = Ai Or 
A3Nn3N 


Since C is a symmetric matrix according to (14.114) and (14.115), it is possible to 
construct an orthogonal matrix @ in such a way that A is a real diagonal matrix. 


14.4 Small Oscillations in a Many-Particle System 


With (14.121) and (14.123), the Hamiltonian of the coupled system splits up 
into a sum of harmonic oscillator Hamiltonians, namely 


mn SS 20a? . 3N 
H= — ee ae 2,2, Cywiw;) = — eo ag? + gh 2 ona: ‘ 
(14.125) 
where we have renamed the diagonal elements of A: 
Alas = iis (14.126) 


Now the Schrodinger equation for stationary states reads 


3N om | 

& | as 2 dq? ae silo) | Y (41,42. ---543n) = EW (41, 42, --- > 43n) 
(14.127) 

As an expression for ¥, we choose in analogy to similar separation problems 


Y = &,(q,) B2(q2)P3(q3).-- Pan (Gan); (14.128) 


so that (14.127) decouples into 3N equations, which describe the same number 
of independent oscillators. The equation for the oscillator with the sth normal 
reads 


h? 6?@,(q,) 1 
cy 5 uae a =14( 54s)” Ps(4s) = E®,(qs) : (14.129) 
Dis Cg: 2 


The solution of (14.127) is, in analogy to (14.102), of the form 


Pye(ds) = C2" Mg!) WAel)"/2} 2 exp( — 34542) Hnsl/ 204s) (14.130) 
where 1, = @,/h. The energy eigenvalues are 
ee Own 5) sy Ny Ole2. (14.131) 


so that the total wave function can be written as 


Gis Fo dan) — ®y, (1) Pnz(42) Pns(43) «-- Pnsy(43n) > 


ae N2,...,93N a ho, (ny ote 4) se he(n2 ate 5) 
se oon AP ha,(n, oF 5 ee ary aalis haan(N3n oP 4) : (14.133) 


The range of the quantum numbers 1,,...,"3n is over all integers, including 
zero. As zero-point energy of the system, we obviously get 


1 ay 
Ey = 5h SS iy (14.134) 
s=1 


The energy levels of the oscillating particle system are obtained by inserting all 
allowed combinations of oscillator quantum numbers nj,...,3n. In this case it 
is sufficient to know the frequencies w, of the normal oscillations. As these 
results were obtained for oscillations with small amplitudes, (14.133) is only 
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valid for the low-energy range of the energy spectrum, ie. for small quantum 
numbers n,. 

Such a physical situation can be found, for example, in molecules and solids 
where the atoms oscillate with small amplitudes about their equilibrium posi- 
tion so that an energy spectrum of the form (14.133) is obtained. 

For larger amplitudes of oscillation, we have to take higher-order terms in 
the Taylor series of the potential into account, such as 


av 


ere 14.135 
OX; OY; 02, eee ( ) 


In this case, a linear force law no longer applies, i.e. the potential energy is not 
a quadratic form of the displacements, and hence the oscillations will no longer 
decouple when normal coordinates are introduced. Under these circumstances, 
our results are only approximately valid. 


RRS =>] eee 


14.3 Two Particles in an External Field 


Problem. Calculate the influence of an external field on the motion of an 
interacting two-particle system with the masses m, and m,. Let the potential 
energies of the first and second particles in the external field be V, (x1, y1, 21) 
and V3(x2, y2,Z2), respectively; let the interaction between the particles be 
WXy = Xo, 1 — Vo ee) 


Hints. a) Determine the time-dependent Schrédinger equation of the system in 
centre-of-mass and relative coordinates. 

b) Assume the dimension of the system to be small, so that the external 
potentials can be expanded around the centre of mass in terms of the internal 
(relative) coordinates. 

c) Expand the total wave function in a basis of wave functions ©,, which are 
undisturbed by the external fields. This basis shall describe the relative motion. 
The coupling of the basis to the centre-of-mass coordinates is assumed to be 
weak, or, equivalently, it can be treated as a perturbation. 


Solution. a) Let the mass and the coordinates of the first particle be m, and 

(x1, ¥1, 21), those of the second particle m, and (x2, y2,2Z2). The interaction 

energy between the particles is of the form W(x; — x2, ¥1 — y2,21 — Z2); the 

potential energy of the single particles in the external field is V,(x,, y,, z,) and 

V2(X2, 2, Z2), respectively. Hence, the Schrédinger equation of the system is 
ov fe h? 


iS Vd ey ee 
Ot 2m, * sees Ri 7 te ena i (1) 


Where WY = ¥ (xy, y1,21,X2, Y2,Z2,¢). Instead of the particle coordinates 
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X4, V1, 2, and X>, yz, Z2, we introduce centre-of-mass and relative coordinates 


_ MX + M4X> 
m, + My, 


ys Smt lene) 


SS Mi SSD 5 


m, + my a? 
_ m42Z4 + M4Z> 
m, + my, (2) 
The coordinates of the particles x,, 1,2, and x2, yz, 22 can be expressed in 


Cy 


terms of these new coordinates: 


= & yk Xk = 
vo yy. Va Y—oV - 


Bp =, A aes 
M5 Why 
ye—2— , §6=—1— (4) 
m, + mM, m, + My 
Now the Laplace operators are expressed in terms of relative and centre-of-mass 


coordinates. Therefore we need 


av OW aX Ov ox _ ow | OW 


= 4H Ht Cn 
in On | ne 


Gold 5 RODE GA 13 ow ues 
A ox? oxox aXéx 4x? 
eve Cae OE 
56 
= 26 S 
we xa 2 
Analogously, we find that 
oY dv Oo aay: 
= 4 a 5 6 
ae Ox? Rene us Oe (6) 
from which we get 
ho s0 Uh Oy 
~ 2m, 6x? 2mz 0x3 
h? Gale he ov (7) 
i 2(m, + Mm yox- 5 mm, \ dx? 
m, + m2 


Using the analogous expressions for the y and z components, and (3), we find for 
the Schrédinger equation (1) 


in = — (h2/2M)V2¥ 
sell (e dE NE ae, Za ys We 2 DINE 
A AOE Se 1 = 0), 2 = 0 ae Wix, yy, 2) 2 (8) 
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with the Laplace operators 


Ve = ii 1 Z = s 
* 56x? ay? az 
62 akg @2 
Q 9 
WE per ee a2 (9) 


and the total mass M = m, + mj, as well as the reduced mass py = (m,m2)/ 
(m, + m2). 

Separation, meanwhile familiar to us, does not work in this particular case 
because the potentials V, and V, prevent a decoupling of the centre-of-mass 
coordinates, making our considerations more complicated. 

b) To proceed analytically with our problem, we assume the extensions of 
the system to be very small. This implies a restriction on systems and states for 
which the wave function ¥ decreases sufficiently fast with increasing relative 
distance r = (x? + y? + z”)!/. A typical distance a, at which the particle prob- 
ability should approximately be zero, is the spatial extension of the system, for 
example the expectation value of radius of the valence electron in an atom or the 
longitudinal extension of a molecule. 

Under this assumption, a consideration of (8) within the range r <a is 
sufficient and we expand the potentials V, and V, with respect to powers of 
x, y, Z. This gives 


V(X + yx, Y+ yy, Z + yz) + V(X — 6x, Y — dy, Z — 62) 
V(X, Y, Z) + Vo(X, Y, Z) + (OV /ex)yx + ... — (OV,/0z)62 + ... 
V(X, Y,2) + wh, ¥,Z,%, y,2) 2 (10) 


The term V(X, Y, Z) denotes the potential energy of the centre of mass; 
w(X, Y, Z, x, y, z) couples the centre-of-mass motion to the relative motion. 
With (10), the Schrédinger equation (8) can be cast in the form 


ow 
ih = [ — (W?/2M)Vz + V(X, ¥, Z)]¥ 
+ = (h7/2Ve + Won 2yY Ww Ze )e (11) 


c) If the external field is absent, the eigenfunctions of the internal motion are 
denoted by ®?(x, y, z), with the energy eigenvalues E°. The following equation is 
valid for these eigenfunctions ®9: 


— (Ff? /2p) Vee + W(x, y, 20) = EB? (12) 


The influence of the external field on the inner degrees of freedom of the system 
is taken into account by the term w(Y, X, Z, x, y, z), so that 


— (h?/2p)V2@, + W(x, y, 2)®, + w(X, Y,Z,x,y,2)8, = E,®, (13) 


The centre-of-mass coordinates appear as parameters in the coupling potential of 
(13). Hence, the wave functions and energy eigenvalues will also depend on the 
centre-of-mass coordinates. 


14.4 Smal! Oscillations in a Many-Particle System 


If w(x, Y, Z, x, y, z) < W(x, y, z), then the coupling potential can be con- 
sidered as a disturbance. If the solutions ©° of the free interacting system (12) 


are known, (11) can be solved. The eigenfunctions and eigenenergies of (13) 
then are 


®, = ®,(x, y, Z, aS EPS) > 
Eo] E(X,¥,Z) - (14) 


As already mentioned, the centre-of-mass coordinates X, Y, Z are only para- 
meters here. The total wave function ¥Y in (11) is now expanded with respect to 
the stationary states @,: 


Gan eee, 2.1) yaeen), 7,1) 


DiGeuven h,. 1, 2) 4 (15) 


Inserting this into (11), we obtain a system of coupled differential equations with 
respect to the expansion coefficients a,(t): 


no(5 inn) = [ — (h?/2M)Vz + V(X, Y, Z)1 Li antr 
+ [ — (A? /2p)Vz + W(x, y,2)] 2 ans 
TEOG 16 Ae Dan ee (16) 
It can also be written as 
“thy 4,6, = — cmpamtyval DeVea) i: Pauls %) | 
+ V¥ a,b, — (A? /21) >) an( Vi Pn) 
+ WY a,F, + wy an, . (17) 


With the help of (13), the last three terms in (17) can be seen to be identical with 
¥, @nEn. Multiplying (17) from the left by @* and integrating over x, y, z yields 


ha, =e (h?/2M) > 2¢®m| Vx | Pn > Vx Gn 


— (h?/2M) Ayam — (h?/2M) ¥ (Pn| Ax| Pn > On 
ey Bane , (18) 


The matrix elements <@,,| Vy|®, > and <@,,| dy |®, > are nonzero only if the wave 
function ®, depends on the centre-of-mass coordinates. In this case, a transition 
of the system from the state n to another state m is possible according to the 
transition matrix elements. 

If the system is prepared in the state i at time t = 0, ie. a,(t = 0) # 0 and 
a,(t = 0) = 0 for all n # i, then, according to (18), a;(t = 0) #0 as well. 
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A small cluster moving 
through an external field. 
The cluster acts like an ele- 
mentary particle in any in- 
ternal state ®, as long as the 
polarization effects (inter- 
action of the internal de- 
grees of freedom with the 
centre-of-mass motion) are 
negligible 


14. Elementary Aspects of the Quantum-Mechanical Many-Body Problem 


Time evolution causes the pure state 
WF = a; P(X, Wg Boy aXe ye Z) 


to become a superposition according to (15). 

If the basis wave functions depend only weakly on the centre-of-mass 
coordinates X, Y, Z, we can, as an approximation, neglect the transition matrix 
elements and find 


Sn = — (WF PMVEq, + (V+ E)ay (19) 


ih 


So the expansion amplitudes a,(t) follow the equations of motion of the centre 
of mass in a potential field of the form 


PS AO ere Gave AAG Ove) (20) 


which depends on E,. This corresponds to the condition that the inner state of 
the system be the nth quantum state. For each n, (19), within the approximation 
chosen, can be interpreted as the motion of a massive point particle. In other 
words, the whole system propagates for each internal state ®, in a slightly 
modified potential! field (see next figure). This is quite reasonable. 


£ 
Leanne ee EE EEEEEEEEEEEEEEenn coat 
R 
SES SSE ESSE SSS as 
14.5 Biographical Notes 
LANDE, Alfred, German-American physicist, *Elberfeld 13.12.1888, +30.10.1975. 
L. was professor at Tubingen from 1922-1931, and thereafter at Capital University, 
Columbus, Ohio. He developed the systematics of the multiplet spectra (1921-1923) 


and the Zeeman effect (““Landésches Vektormodell”). He also introduced the g factor 
named after him. 


15. Identical Particles 


One characteristic of quantum mechanics is the indistinguishability of identical 
particles in the subatomic region. We designate as identical particles those 
particles that have the same mass, charge, spin etc. and behave in the same 
manner under equal physical conditions. Therefore, in contrast with macro- 
scopic objects, it is not possible to distinguish between particles like electrons 
(protons, pions, « particles) on the basis of their characteristics or their trajec- 
tory. The spreading of the wave packets that describe the particles leads to an 
overlapping of the probability densities in time (Fig. 15.1); thus we will not be 
able to establish later on whether particle no. 1 or no. 2 or another particle can 
be found at the point in space r. Because of the possible interaction (momentum 
exchange etc.), dynamical properties cannot be used to distinguish between 
them, either. 

If we regard a quantum-mechanical system of identical particles, we will not 
be able to relate a state y, to particle no. n; we will only be able to determine the 
state of the totality of all particles. 

In the case of a system of N particles with spin, the wave function of the 
system is a function of these 4N coordinates (3N-space and N-spin coordinates): 


W = W151, 1252, -.-.twSn, 0) (SD) 


Since the system consists of identical particles, the physical state remains the 

same if the particles j and i are exchanged. This operation is carried out by the 
operator P;;: 

Piw(rys,, ee 

= AW(rS1,-. 


where, for the present, 2 is an arbitrary constant factor. A second exchange of the 
two particles recreates the original state. Hence, 


ViSig tS)... Sn, ) 


PGS as ¥ i St, <--,TNSNGE) 5 (is. 2) 


PRy=Rp=y , (15.3) 
yielding two values for /: 
ara (15.4) 


Since we are examining systems of identical particles, the exchange of particles 
always acts in the same way on the wave function. This means that two systems 
of particles may exist; systems with wave functions that change sign upon the 
exchange of two particles, and systems whose wave functions remain un- 


changed. 


no distinction 


Fig. 15.1. Overlapping — of 
probability densities (sche- 
matic). Originally the wave 
packets for particles no. 
1 and no. 2 are prepared 
separately. As time evolu- 
tion of the packets proceeds, 
they overlap (doubly shad- 
owed area) and it is no longer 
possible to distinguish the 
particles 


particles possible 
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trai 
Therefore either 
Pa, =, or Pi, =—ywy, holds. (ssi 


We call the wave function w, with the eigenvalue +1 symmetric and w, with 
the eigenvalue — 1 antisymmetric with respect to the exchange of two particles. 
This is the origin of the indices “s” and “a” in (15.5). Whether particles are 
described by a symmetric or an antisymmetric wave function will depend on 
their nature; a transition between symmetric and antisymmetric states is impos- 
sible. This is because the interaction between particles is symmetric under their 
exchange; hence, e.g., 


Vis tosee t,o psa in) =) intent ol ee nl ae (15.6) 


For this reason, the matrix elements between symmetric and antisymmetric 
states vanish: 


CWe('15 F25 ++ AK, ese oce | Viti tas eo dae) 
WA fo is iN Oe (15.7) 


and therefore no transitions take place between them. Both kinds of particle 
occur naturally. The particles described by an antisymmetric wave function are 
called fermions (named after E. Fermi); those particles described by a symmetric 
wave function are called bosons (named after S.N. Bose). 

The physical criterion that distinguishes between the two kinds of particles is 
their spin: Fermions have half-integer spin; bosons have integer spin. This relation 
between spin and symmetry properties of the wave function or —as it is also 
called — between spin and statistics, was first found empirically. Later on, when 
concerned with quantum field theory (quantum electrodynamics), we will under- 
stand why this must be so. 

Examples of fermions are electrons, protons, neutrons, neutrinos, C!* nuclei, 
etc. (all spin 4); examples of bosons are x mesons (spin 0), photons (spin 1), 
deuterons (spin 1), « particles (spin 0), oxygen nuclei (spin 0). For particles 
that are composed of several elementary particles, the spin also determines the 
character of the statistics, as has already been mentioned. The « particle that 
consists of 4 nucleons with spin 3 has spin zero and is a boson. We get the same 
result when considering that the exchange of an & particle requires the exchange 
of two protons and two neutrons; the signs that result from the two-fermion 
exchange compensate for each other in this case. 


15.1 The Pauli Principle 


The antisymmetry of the fermion wave function is equivalent to Pauli’s exclu- 
sion principle, empirically formulated by Wolfgang Pauli in 1925 when he was 
investigating atomic spectra. It states that there can be only one electron 
in a particular quantum-mechanical state. This simple formulation of Pauli’s 
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principle, however, has to be specified in somewhat more detail. We have just 
stressed that in a system of electrons only the state of the whole system and not 
that of the single particles is defined. Accordingly, the state of an electron in an 
atom will certainly change if another electron is put into the electron shell or if 
the atom is ionized. 

We will be able to avoid these difficulties if we refer to the measuring process 
used on an electron. Taking into account the degree of freedom resulting from 
the spin, the electron has four degrees of freedom. Therefore, its state is charac- 
terized by four independent numbers. As usual, the appropriate quantities to be 
chosen are energy, angular momentum, the z component of the angular mo- 
mentum and the z component of the spin. This set of quantities corresponds to 
the quantum numbers », I, m,, m,. The choice of another set of quantities, e.g. the 
three momentum components and the spin component, is also possible. Accord- 
ing to the choice we make, the wave function is then determined by four 
quantum numbers: 


W => Ae ‘ (15.8) 


Now we can formulate the Pauli principle in a more precise form: In a system of 
electrons, the measurement of four quantities that are typical of the electron 
(e.g. the quantum numbers n, |, m, m,) can have a well-defined (fixed) value for 
one electron only at any one moment. Two electrons can never simultaneously 
occupy the state (15.8). Soon we will understand the thus formulated Pauli 
principle as a consequence of the antisymmetry of the wave function described 
in (15.5). 

Since this empirically ascertained principle is a consequence of the anti- 
symmetry of the fermion wave function, the Pauli principle is not only valid for 
electrons, but for all fermions. 


15.2 Exchange Degeneracy 


We consider a system of N identical particles without any interaction; the 
inclusion of interactions would not change any of the following fundamental 
considerations. The Schrédinger equation for such a system is 


(A, as iD Be am Hy) W(ris1, 1252, --.,fwSw) = EW(r1 51, 252, ---»tnSn).- 
(15.9) 
The single-particle Hamiltonians Hy(r;, s;) can be distinguished from each other 
by the fact that they act on different particles. If we designate the ith eigenfunc- 
tion of the particle k by @;(r;, 5,), we will have for the eigenvalue problem of the 
single particle 
A, (re, SOM S,) = E;Qj(ty, Sx) 5) ‘— il. De. 6 = es 5 
(5.110) 


The Schrédinger equation (15.9) is then solved by the product of single-particle 
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wave functions: 


W (F151, 1252, ...,twSy) = Pit S1)Pi,(r2, $2) --- Pinltn, Sy) - (1S) 


The i; (j = 1, 2,...) are special numbers that characterize the eigenfunctions. If 
there are n; particles in the state g;, then we will get for the eigenvalue of the 
total energy 


Because of the indistinguishability of the particles, we are not able to say which 
particle is in which state. This means that there are N!/(n,!n2!n3!...) combina- 
tions of single-particle wave functions in (15.11) that give the same energy 
eigenvalue E. This is called exchange degeneracy. 

Exchange degeneracy is lifted by the requirement of wave-function sym- 
metry for the bosons and fermions. In fact, the entire space of functions spanned 
by the eigenfunctions to the energy E contains only one symmetric and one 
antisymmetric wave function. The symmetric wave function for bosons results 
from the sum of all possible N! permutations of the arguments of the single- 
particle wave functions in (15.11). If we designate the permutations by P, then 
the wave function of a system of bosons together with its normalization factor 
(Nir! ae reads: 

1 N! 


Wooson = » P@;, (ti, $1) @i,(12, 82) --- Pinltn, SNe (i5ni3) 
Nin,ing!... jp= i 


Here, we have assumed that the single-particle wave functions are orthonor- 
malized. 


15.3 The Slater Determinant 


The antisymmetric wave function is generally accepted to be best expressed 
in the form of a determinant. Slater’s determinant is an N x N determinant 
consisting of a single-particle wave function (15.10) arranged in the following 
way: 


i, (1, $1) ONG SS) a Pin (F1, $1) 
1 gi,(r 5 ) Qi, (¥ 8 ) me Piy(¥ +8 ) 
Wy ierien = ae ak ‘ . (15.14) 
wi! 
Qi, (ty, Sy) i, (tn. Sy) 0; ne Sy) 


Note that each column always contains the same single-particle wave function, 
while each row contains the same argument in the single-particle wave function. 

The determinant form easily ensures the required properties of the fermion 
wave function in an elegant way. By interchanging two particles (two rows), the 
sign changes. The function will vanish if two particles occupy the same state 
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(two columns are equal). This is Pauli’s principle! It is thus a consequence of the 


antisymmetry of the wave function, an indeed most interesting and fundamental 
es tilits 


We shall now illustrate these features in the following example. 


ENANMPL! === eS en) 


15.1 The Helium Atom 


Some phenomena occurring in many-body problems may be demonstrated 
by the helium atom. It consists of the He nucleus surrounded by two orbiting 
electrons. For the mathematical description of the helium atom, we start with 
the independent motion of two electrons in the Coulomb field and treat their 
mutual electrostatic interaction as a perturbation. The Hamiltonian of the 
systems is 


Hy = (A(1) + A(2) + W12))b = EW.) (1) 


The Hamiltonians H(1) and H(2) are the operators for the single-particle 
problem, i.e. the individual electrons. They are 


2 


n h 
H(1) =— wee + V(r,) and 


f1Q)=- 41+ Vin) 2) 


with V(r) =—Ze7/r, Z = 2. 

The solutions are obviously obtained from the hydrogen wave functions if 
Z = 1 is replaced by Z = 2. Hentce, 

A(1)b-(1) = EnW(ri) and 

(2) Wlr2) = Essa) - (3) 


The indices r,s represent the set of quantum numbers n, l,m. The degeneracy 
of the hydrogen wave function will not be taken into account to prevent the 
problem from becoming too complicated. The two-electron Schrédinger equa- 
tion without interactions now reads 


How = (A(l) + H(2))y = EW (4) 


According to the separation of the problem expressed in (2) and (3), we immedi- 
ately get the product wave function 


Writ) = Wer) Ws(r2) (5) 
which obeys the eigenvalue equation 


AoW(n.12) =(E, + EW. t2) (6) 
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Obviously, not only the state w belongs to the eigenvalue FE, + E,, but also the 
state 


Writ) =WslrdWe(r2) ; (7) 


where the first particle is in the state y, and the second particle is in the state y,. 
This is the exchange degeneracy mentioned earlier. The two states ¥ and wy’ 
emerge from each other by interchanging the coordinates of the particles. As 
a solution of the Schrédinger equation (4), we therefore have to consider a linear 
combination of two states: 


Wir, 12) = a(n) Ws(r2) + bYs(ri)Y(r2) (8) 
Because of the normalization of the states, 

bal (9) 
holds. 


Now we take into account the influence of the interaction by using perturba- 
tion theory. Therefore we start with (11.25). (Here, we have named the perturba- 
tion energy “W” instead of “eW”.) The unperturbed energy is E° = E, + E,; the 
expansion coefficients a,, are a and b. With the abbreviation 


e=E-—E,-—E, , (10) 
we get the two equations 


(Wi — a+ Wob=0 


ae Wo 2) — Oe (11) 
with the secular determinant 
Wi4,-—€ W, 
ies 11 12 (12) 
Wr Woo —& 
The matrix elements of the perturbation are given, together with the interaction 
ee e? 
W(1,2) = ‘= Z (13) 
[fife tie 
by the integrals 
r r Z Ss Ls i 
Hg = Way = 02 MEE MOD gy, ay a 
12 
and 
WM2= Wy 
* * 
ee Ve Ha (2) ay, dV, ; (15) 
12 


Usually the matrix elements are denoted by the letters 


Wi=W.=K and W,=Wy,=A . (16) 


15.3 The Slater Determinant 


The quantity K is the Coloumb interaction of the two charge densities e|w,(r;)|? 
and e|w,(r2)|?. The quantity A is called the exchange energy; it has no classical 
analogue. The exchange integral is due to the fact that an electron may be in 
the state w, as well as in the state y,. The magnitude of the exchange integral 
depends on the product w,\,, i.e. on the overlapping of the two wave functions. 
Thus, for example, the exchange energy between the ground state and a highly 
excited state is very small. 

From the requirement for a nontrivial solution of system (11), it follows that 
the secular determinant has to vanish: 


Daw. (17) 
so that 

(Ke) =A? (18) 
has to hold. Therefore we get for energy splitting by the perturbation 

E=KPA . (19) 
For e= K + A, (9) and (11) yield 

LE = ads (20) 


and for e = K — A, analogously 


1 
= (21) 


We: 
The exchange degeneracy is broken by the interaction; the state splits up into 
a symmetric and an antisymmetric state: 
j 
Wert, %2) = Wee + W(ri)W-(r2)) and 


Wap FaWeledate BG) (22) 


Until now, we have regarded the electrons as spinless particles. Since the 
electrons have a spin, (see Chaps. 12, 13) they are fermions and their total wave 
function has to be antisymmetric. The interactions involving spin (spin-orbit, 
spin-spin) are neglected; then we can write the total wave function as the product 
of the space ()- and spin (y) wave functions: 


w= Wr, r2)x ° (23) 


Since the total wave function has to be antisymmetric, the product functions (23) 
always consist of an antisymmetric and a symmetric function. Either the spatial 
part w is symmetric and the spin function y antisymmetric, or vice versa. 

We denote the spin function of the particle 1 with Spin Upibya | ctc. Three 
symmetric and one antisymmetric state can be constructed from the spin 


373 


Example 15.1 


374 


Example 15.1 


15. Identical Particles 


functions: 


1 Z rz 
te = (ka Xa se ie) 


Gy Ga 


O — 


12 = eta — xi x2) (24) 
ip 

The factors Wy are necessary for normalization. The helium atoms with 

symmetric spin function are called orthohelium; those with an antisymmetric 

spin function are called parahelium. The properties are summarized in the 

following figure. E, is the ground state energy of the hydrogen atom for Z = 2; 

E, is the corresponding energy for the first excited state. 


Wave function Orthohelium Spin Energy 
1 el E, +E 
alta teva) — valence) - s N ee 
(Ni V9 2 Lo) metastable 
Xi 2 Q 
. t+ [Ee 
Parahelium 5 
2E4 
1 1 
= + 1 ee, Sey = Wave = 
Tad Yor )w OUTS e ewae) eee es) 


Ist excited 


= State 
: xe) 
Orthohelium and para- ‘ | 3 ground state 
helium wave functions and 5 On eR eA pee 
energies 5 


2Ey 


Parahelium is energetically the lowest state of helium. Its spatial wave func- 
tion is symmetric; both electrons may simultaneously occupy the ground state. 
Then the Pauli principle requires an antisymmetric spin function. For ortho- 
helium, the spin function is symmetric; the Pauli principle prevents both particles 
from simultaneously being in the ground state. Because of the small spin-spin 
interaction (interaction of the respective magnetic dipole moments) the possi- 
bility of a spin flip in orthohelium is very small; therefore orthohelium represents 
a metastable state of helium. 
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DAN = [_ eee 


15.2 The Hydrogen Molecule 


The exchange energy is the reason for the homeopolar binding of molecules. 
To get a better understanding of this kind of binding, we examine the simplest 
example: the hydrogen molecule. Here, we use perturbation theory, as we did for 
the helium atom in the preceding example. As an approximation of zeroth order, 
we use products of the hydrogen eigenfunctions for the molecular wave function. 
We begin with two hydrogen atoms that are very far away from each other and 
regard the forces occurring during their approach as a perturbation. It is clear 
that this way of treating the problem is not very accurate, because the forces 
appearing in the molecular region of interest will not be small any more. With 
the notations given in the following figure, the Schrédinger equation for the two 
electrons in the potential of the two protons is 


e2 e2 e2 e2 e2 


h2 

- Fass ay + | = - Se a Jue , (1) 
2m Piz Vax Yaa Mon M2 

The distance ra, is used as a parameter in the following calculation. The 

Hamiltonian H of equation (1) is now split up in two different ways. With the 

abbreviations 


es h? a 
Le 
: 2m tat 
x h? e 
‘Ay = _ an A, — and 
m th2 
= eo te ee 
Wi2,01 = — = a : (2) 


we can write 
H= at te A,2 oF Wi2,o1 


This decomposition corresponds to associating electron | to nucleus a and 
electron 2 to nucleus b. Consequently, for a large distance between the two 
nuclei, the W,,,,, part of the Hamiltonian, which is treated as a perturbation 
in our approximation, vanishes. With this association of electrons to the nuclei, 
@) becomes for 7, = © 


(= oe (3) 


which is an equation that describes two noninteracting hydrogen atoms. It is 
solved by the product wave function 


Uri, 2) = Waltar)Wo("2) (4) 


where W symbolizes the hydrogen wave functions that follow from the equation 


iatlin = 18a (5) 
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and the corresponding equation for y,. Here, we set E = 2E,, since we like to 
assume that both hydrogen atoms are in the ground state with energy Fy. 

Immediately, we see that we have an exchange degeneracy for this problem, 
since associating electron 2 with nucleus a, and electron | with nucleus b leads 
to equations that differ from the former one only in the indices. Instead of the 
operators (2) we then have 


h? e7 - h? ee 
fee ee 
i 2m ob 2 2m) Try 
eo” e- e 
Mase = — : (6) 


Yo Nha P12 


Again, W,,.52 vanishes for r,,— 00, and the remaining part of (1) is solved by 


V(F1, ¥2) = Wo(To1)Waltar) (7) 


The hydrogen wave functions wy fulfil a correspondingly changed form of (5). 
Equation (1) is solved for r,,-> 00 by a linear combination of the functions (4) 
and (7), namely 


au + bv = agra) Wo(to2) + bWo(Tr1) Walter) - (8) 


We use this linear combination (8) of the functions u and v as the zeroth 
approximation for the solution of our problem. Note that the functions u and 
v are only orthogonal in the limit r,, > co. When the two nuclei come closer to 
each other, the electron wave functions overlap and the integral 


[S|? = [uted dV, 
as [we (rar o(dY J Waltar)W #(142)d V2 
a eee 


S S* 


(9) 


will be nonzero. 
As a result of the perturbation W, the energy of the system changes, as does 
the wave function. We write 


E=2E,+e and W=au+bv+o . (10) 


The various terms W,; 52, W251, € and @ are assumed to be small; products of 
these quantities are neglected in the following. Now we insert (10) into (1) and 
get, by neglecting products which are small in second order, 


a( Ay SP Fite Waa niyo b(Ha2 a Bb ap idsiyipy Mar (ho Sr Hy2)@ 
= 2E,(au + bv) + e(au+ bv) +2E,@ . (11) 


The parts of the unperturbed system cancel and after reordering according to 
the different functions, we have 
a(Wao,n1 — &) + b(War.n2 — ev + (Har + Hy — 2Ey)p =0 (12) 


For a = b = 0, (12) is a homogenous differential equation for g with the solution 
~ = u, as a comparison with (3) shows. We use the theorem that the solution of 
a homogenous differential equation is orthogonal to the inhomogenous part of 
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the differential equation. Hence, 
{La(Man1 — €)u + b(War yo — evl*udV, dV, =0 . (13) 


In formulating (11), we have expressed the Hamiltonian H that acts on y via the 
decomposition (2). If we use the decomposition (6) as the next step, we will get 
instead of (12) the differential equation 


a(Wi2.51 — su + b(War.n2 =) ioe (ae ate ha See —O- | (14) 


which is solved by the function g = v for the homogenous part. Then the same 
argument as above in (13) leads to the integral 


J LaMar. — €)u + b(War42 — eul*vdV, dV, =0 . (15) 
It holds that 


Slul? Ward dV, = Jol? WaedWidvze=K , (16) 
and 
fu*vW2n1dV dV, = | v*uW 42dV, dV,=A . (17) 


The equality of the integrals is due to the fact that the arguments differ by their 
indices only. Here, K is the Coloumb energy of the perturbation; A is the 
exchange energy. If, for example, we insert the perturbation W in the form (2), it 
will follow that 


2 
K=-—e? peal ay, ae oj Molsa)I dV; 


Phy Yq? 
i e? ate! \Wr(r42)| ee (18) 
and 
A=-— e2 peas olos) ay, x S* — 2 f Wa(ta2) Wb (1p2) dV, s 


V1 Fa2 


WE Ca)WE(r2 alta Wol"o1) yy. gy, 


ie 


+e? { 


In the case of the Coulomb energy, the various terms express the energy of the 
interaction of the various electron charge distributions with the other nucleus 
and the mutual interaction of the two electron charge distributions. 

In the case of the exchange energy, the mixed densities appear. The quantity 
S, defined in (9), expresses the overlap of the nonorthogonal electron wave 
functions. We have 


S(rap > coy — 0 and S(rap > 9) = 1 


By using the abbreviations introduced for the different integrals, we can write 
(13) and (15) in compact form: 


(qeeKer(es° =wA)b = 0 (19) 


SHH 


Example 15.2 


378 


15. Identical Particles 


Example 15.2 


The different energy contri- 
butions to the binding of the 
hydrogen molecule 


(eS? — Ajat(e—K)b=0 . (20) 


Thus we have two equations for determining the coefficients of the linear 
combination (8). Assuming a nontrivial solution for the system of equations, its 
determinant has to vanish, yielding the relation 


(e — K)? = (eS? — A)? 


The solutions of these equations give the energy shifts 


K-A 
eG ee wae ’ (21) 
K+A 
= = 22 
oy) 1482 és ( ) 


The insertion of these two solutions into (19) and (20) yields for the coefficients 
a=-—b for e, and (23) 
@=( itor @ . (24) 


Therefore we get a symmetric solution, with the energy 


K+A 
FE, = 2E, + —— _, 25 
s il ale 1 : §? ( ) 
and an antisymmetric solution, with 
K—A 
Eo 25,5 26 
a 1 +r i §2 ( ) 


To ascertain the energies, we have to calculate the integrals K, A and S with the 
wave function of the ground state of the hydrogen atom. Owing to the extensive 
calculation necessary for this task, we give only a graphical representation here. 
We treat the protons as classical point particles. Then the energy 
2, 
i (27) 
Yab 

is the binding energy of the molecule. In the following figure, the Coulomb energy 
K + e*/r,, is given as a function of the distance of the nuclei (in units of Bohr’s 
radius). The result is a very weak binding. The exchange energy is negative and, 
except for the case of very small distances, it is greater than the Coulomb energy. 
This causes a stronger binding in the symmetric state (22), and repulsion in the 
antisymmetric case (21). Therefore the sign of the exchange energy is responsible 
for the binding of the H, molecule. The real binding energy of the hydrogen 
molecule is much smaller than the value of this calculation (—4.4 eV). In spite 
of this quantitative failing, the calculation gives an idea of how homeopolar 
binding comes about. 

Thus we have found a symmetric local wave function of the ground state of 
hydrogen. Because of the Pauli principle, the spin function has to be antisym- 
metric, i.e. the electron spins are oriented in an antiparallel manner [see (24) of 
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Example 15.1]. We distinguish, as with helium, between parahydrogen (singlet 
state) and metastable orthohydrogen (triplet state). 

Our presentation of a solution to the problem by perturbation theory is 
based on considerations by W. Heitler and F. London. More precise methods 
first solve the two-centre problem for the electrons, i.e. the Schrédinger equation 
with the Hamiltonian (2) without the electron-electron interaction e7/r,,. In 
1930, E. Teller and E.A. Hylleraas applied this new method. 

Recently this kind of problem has again attracted attention, particularly in 
connection with the creation of very heavy quasi-molecules in the collision of 
very heavy ions. Since this scenario is a two-centre problem of very heavy nuclei 
and one electron (e.g. uranium-uranium molecule) with small distances between 
the nuclei, the two-centre Dirac equation! has to be solved, because the inner 
electrons in heavy and very heavy elements are relativistic. We shall treat this 
topic in more detail when discussing relativistic quantum theory. 


EX ACC E 


15.3 The van der Waals Interaction 


As an example of an application for the variational method (see Chap. 11) 
we calculate the long-range (van der Waals) interaction between two hydrogen 
atoms in their ground states. To this end, it is useful initially to treat this 
problem using perturbation theory, because afterwards it is easier to see that the 
leading term of the interaction energy is inversely proportional to the sixth 
power (~ 1/R°) of the varying distance R between the two atoms. It will also be 
come apparent that perturbation theory and the variational calculation repres- 
ent opposite limits for the determination of the coefficients of the Vi term: 

The two atomic nuclei A and B of the hydrogen atoms are separated by 
a distance (see figure) and the z axis is given by the connecting line between 
Aand B. We denote the local vector of electron | relative to nucleus A by r,, and 
the local vector of electron 2 relative to nucleus B by ry. 


A IRs B z axis 


1 See B. Miiller, W. Greiner: Z. Naturforsch. 31A, 1 (1976). An extensive discussion 
of this exciting physics can be found in: W. Greiner, B. Muiller, J. Rafelski: Quantum 
Electrodynamics of Strong Fields (Springer, Berlin, Heidelberg, New York, Tokyo 1985). 
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The Hamiltonian for the two electrons reads (with spin-orbit coupling 
neglected) 


H=H,+H' : with 


a h eee 

He (Vey?) = 

0 5 (Viv 3) as 

ee 

ek ha (1) 


where, in the ground state, the unperturbed Hamiltonian Hp has the stationary 
solution 


Polti, 2) = Wroolti)Wroo0lr2) (2) 
Here ¥,,,,(r) denotes the known hydrogen eigenfunctions [see (9.45) ]. We regard 
H' as a perturbation, which is surely approximately valid for a large distance 
between the two atoms R > do, dp being Bohr’s radius. 

Since we are interested in the leading term of the interaction energy, we 
expand H’ according to powers of 1/R and pay attention only to the terms of 
lowest order: 


e? (22 — 24} 
‘= —<] 1 
H =| +| ar R 
(x2 — Dae aioe yi)? ae (4 Zuo ae 
ap R2 
2 AY SIP 2 AS =e 
— iia et = ea 
R R R R 
pe 
Re aa (K1X2 + Yi V2 — 22122) . (3) 


The leading term apparently describes the interaction energy of two electric 
dipoles, which are given by the momentary configuration of the two atoms. The 
neglected terms of the order 1/R* and 1/R° correspond to the instantaneous 
dipole-quadrupole interaction and the quadrupole-quadrupole interaction, 
respectively. 

Now, it is clear that the expectation value of the leading term of H’ (3) in the 
state Yo(r1, r2)(2) vanishes, because Wp is an even function of r; and rz, while H’, 
on the contrary, is an odd function of r,; and r,. One can also show that all 
neglected terms of higher order in H’ have a vanishing expectation value in the 
state Y(r,, 2), too, because these terms may be expressed by spherical har- 
monics Y,, with 1 4 0. Therefore the leading term of the interaction energy of the 
two atoms has to be of second order in perturbation theory when the dipole part 
[see (3)] is taken into account, and so it has to depend on the distance like 1/R°. 


Perturbation Theory. In second-order perturbation theory, the interaction 
energy of the two hydrogen atoms is 


| <n] H'|0> |? 
Wikio) ee 
2 1p 5 (4) 
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Here, the index n stands for all states of the pair of electrons of both hydrogen 
atoms, including the dissociated states. Because of (4), it is clear that W(R) is 
negative, since E, > Ey and therefore nominator and denominator are always 
positive. Hence, we may conclude that the interaction of the two hydrogen 
atoms is attractive and proportional to 1/R° if R is large (i.e. R > ao). We realize 
that this conclusion is valid for any pair of atoms which have nondegenerate 
spherically symmetric ground states; this is generally the case. 

We can get an upper bound.-for the positive quantity — W(R) as follows. Let 
us substitute all E, in the nominator by the lowest value E,. Here, EF, is the first 
excited state (2p state) of the hydrogen atoms. Then all denominators in the sum 
(4) are equal, and the summation is performed in the following way: 


nl H'O>P om [nl H'0>? 


WIR) = < 
2, E, — Eo fg 21 — Eo 
1 A 
SS OL, AOE 
Ey =a Eo = 


1 ~, an on 2 
p(t <O| H’|n> <n| H'|0> — |<O|1’]0| ) 


(<0| H’2|0> — <0|H’|0)?) 


i Lo 
0|H’2|0 
_ <0LFF'?/0 a 
Ey £e 
Here, we have used the completeness relation and the well-known fact that 
 <0|H"|0> = 0 
ius 
<0|H'7|0> 
Zo ae (6) 
er: 
Now, we have (see Chap. 9) 
2 
Ey = —2 (=) (both atoms in the ground state) 
do 
2 
E,=-2 fa (both atoms in the first excited state) 
ao 
Consequently, 
dee (7) 
E, —Eo= Te z 


Furthermore, it follows from (3) that 


2 
A? =< (xdxd + yiyd + 42328 + 2eixayiy2—-) - (8) 


Example 15.3 


382 


15. Identical Particles 


Example 15.3 


The expectation value of the mixed terms (like 2x, x21 y2) vanishes for the same 
reason as in the former discussion (an odd function of the components of ry 
and/or r). In addition, each of the first three terms of (8) yields a product of 
identical factors. For instance, 


[ x? |troo(r) @P@r= <hr lWroo(r)|?d?r 


1 co 
=—~ [re 4nr*dr=agp . (9) 
3 
Bee 
Then 
6e7 
o|f#?2]0y = 5% 


With this equation, (6) reads 


8e7az 
(AG) 


(10) 
Variational Method. Eq. (10) represents a lower bound for W(R). An upper 
bound may always be computed by a variational procedure. We have to 
consider the problem of choosing a reasonable test wave function ¥. If we 
choose ¥ to be independent of R, then the expectation value <Y|H’|) is 
proportional to 1/R°, which is not useful for our case, since we want to know 
the coefficient of the 1/R° term. Therefore we have to take into account the 
polarization effects in the wave function. Since we assume that the polarization is 
proportional to H’, we write the test function in the following way: 


P(r, 62) = Wrool)Wioo(r2) + AH’) 
= Wo (rr)(1+ AH’) , (11) 


where A represents the variational parameter. Then the variational problem 
[see (11.32, 11.37)] gives 


Eo + W(R) 
ls (ryr.)Q + AH’ )(Ho a! )Yo(ri ra) + AH')d?r, d3r5 


(12) 
SJ lotr. r2)/7 + AH’) d?r; d? rz 
A is assumed to be real. The right-hand side of (12) may be rewritten as 
Ey + 2A<0|H'2|0) + A?<0|A’ Ho H’|0) (13) 


ee Os 


since Yo(r,,r2) is a normalized eigenfunction of Ho, with the eigenvalue 
Eo = e7/ao, and, furthermore, 


<0|H'|0> = <0|H'3|0) = 0 


The matrix element <0|H’ Hy H'|0) gives a negative contribution. This can be 
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shown by inserting complete sets of eigenfunctions of Ho: 


<0|H’HoH’|0> = Y <O|H'|n> <n| Holm) <m| H"|0> 


n,m 


= SC eo 


because all E, are negative. Thus, by 
Eo + 2A<0|H'?|0) + A?<0| A’ Hy H'|0> 
ee coe 
ine 24 <0|H'|0> 
~ 1+ A*<0|H'2|0) 


we can give an upper bound for (13). 
As we are interested only in terms up to the order H'?, we expand the 
denominator of (14) and obtain 


(Ene ACOH 2105) Ae OVA210>)* 
Sie OA — EpA)<OIn = |0) (15) 


(14) 


Eo, the energy of the ground state of the two hydrogen atoms, is negative. 
Therefore (15) has a minimum at A = 1/Eo, and thus (12) takes the form 
0|H’2|0 6e7a% 
4 SOU?IOy _ Sea 
Eo R 


Eg + W(R) SE (16) 
Together with (10) we have both an upper and lower bound for the interaction 
energy, which can be expressed by the inequality 

8e7 ae 6e7 a8 


= < W(R) = — ye 


= (17) 
Finally, we should note that careful variational calculations have shown that the 
numerical coefficient in W(R) is close to 6.50.* The result achieved in this way is 
not absolutely correct, because only the static dipole-dipole interaction has been 
considered. If we also take into account the retardation caused by the finite 
speed of propagation of the electromagnetic interaction of the two dipoles, 
we find that W(R) ~~ —1/R’, if R is large compared to the wavelength of the 
electromagnetic radiation of the atomic transition: 


iis 
(R a 13740 


e2 


But the interaction energy at these large distances is so small that it is physically 
uninteresting (insignificant). Therefore we may proceed from the assumption 


2 See, e.g., L. Pauling, E.B. Wilson, Jr.: Introduction to Quantum Mechanics, (McGraw 
Hill, New York 1935), Chap. 47a. 
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that expression (17) is a useful and reasonable approximation for the interaction 
of two spherical atoms’. 


15.4 Biographical Notes 
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> See, e.g., H.B.G. Casimir, D. Polder: Phys. Rev. 73, 360 (1948). 
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W.’s thesis, in particular the existence of critical temperature. In 1875 he was appointed 
a member of the Royal Dutch Academy of Sciences and two years later became a tenured 
professor at the University of Amsterdam. W. was a much-admired teacher and inspired 
his students to do both experimental and theoretical work. His scientific papers were 
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nology (MIT) during the years 1930-1951. 


385 


16. The Formal Framework of Quantum Mechanics 


In this chapter we summarize the mathematical principles of quantum mechan- 
ics, using a more abstract mathematical formulation than before. Many of the 
relations which will be considered here have already been discussed in the 
preceding chapters in a more “physical” way and most have been proved in 
detail. Some of the explanations and proofs are supplemented or demonstrated 
once again in a more compact manner in additional excrcises. 


16.1 The Mathematical Foundation of Quantum Mechanics — 
Hilbert Space 


By a Hilbert space H we mean an abstract number of elements, which are called 
vectors |a>, |b>, |c> etc. H has the following properties: 


1. The space H is a linear vector space above the body of the complex numbers 
wand y. It has three properties: 

a) to every pair of vectors |a), |b >, a new vector |c> is related, which is called 
the sum vector. It holds that 


la) +|b>=|b>+ |a> (commutative law) 


(la) +|b>)+|c>=la>+([b> +]|e>) 5 (associative law) (16.1) 
b) a zero vector |0> exists, with the property 

fe ae le (16.2) 
c) to each vector |a> of H, an antivector | — a> exists, fulfilling the relation 
la>+|—a>=|0> ; (16.3) 


for arbitrary complex numbers yu and v, we have 
u(la>+|b>)=Hla> + HIDD 
(u+vla>=hnla>+ vlad , 
wla>=e(vIa>) 
ie — a>) (16.4) 
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enone senha i mT pe 
2. A scalar product is defined in the space H. It is denoted by 
(la>,|b>) or <alb> , (16.5) 
yielding a complex number. The scalar product has to fulfil the relations 
(la), A|b>) = A(la>, 1b>) 
(|a>, |b> + |c>) = (a>, |b>) + (a>, le>) 


(la>,|b>)=(1b>,|a>)* . (16.6) 
The last equation may also be written as 

<a|b> = <bla>* 
It is easily shown from this, that 

(A|a>,|b>) = A*(|a>, |b>) = A¥<alb> , (16.7a) 
and 


(la; > + |a2>, 15>) = (lai >, 1b>) + (la2>, |b>) = (ai |b> + (a2|b> 
(16.7b) 


follow. The norm of the vectors is defined by 


[a> |l =./<ala> 
(read: norm of vector |ja> = ./ (ala>). 


It can be shown that Schwartz’s inequality, 


a> dled slkalbd| , (16.8) 


is valid and that the equality is only valid for the case 
lja> = 2|b> 


(parallelism of the vectors). 


3. For every vector |a> of H, a series |a,> of vectors exists, with the property 
that for every ¢ > 0, there is at least one vector |a,)> of the series with 


Ila>—la><e . (16.9) 
A series with this property is called compact, or we may say |a,,> of the space H is 
separable. 


4. The Hilbert space is complete. This means that every vector |a) of H can be 
arbitrarily exactly approximated by a series |a,,: 


lim |[]a> —|a,>||=9 . (16.10) 


Then the series |a, > has a unique limiting value |a>. 

For Hilbert spaces with finite dimensions, axioms 3 and 4 follow from 
axioms | and 2; then 3 and 4 are superfluous. But they are necessary for spaces of 
dimension oo that occur in quantum mechanics in most cases. In the following, 
we discuss once again some definitions that are used very often. 


16.2 Operators in Hilbert Space 


1. Orthogonality of Vectors: 
Two vectors | f> and |g» are orthogonal if 
Cie ya (16.11) 
2. Orthonormal System: 
The set {|f,>} of vectors is an orthonormal system if 


Calter Onn = (16.12) 


3. Complete Orthonormal System: 


The orthonormal system {|f,>} is complete in H if an arbitrary vector | f> of 
H can be expressed by 


= Say (16.13) 


In general, «, are complex numbers: 


Xm = Gali ors Cie ome 
= Ser Ge ie 


ae Doron 
= (16.14) 


so-that we can write 


I> =D <hIF> (16.15) 


The complex numbers «,, are called the f,-representation of | f >; they represent, 
so to say, the vector | f >; they are the components of | f> with respect to the 
basis {| f,, >}. If the sum in the last equation encloses an infinite number of terms, 
then we speak of a Hilbert space of infinite dimensions. In quantum mechanics, 
this is usually the case. 


16.2 Operators in Hilbert Space 


A linear operator A induces a mapping of H upon itself or upon a subspace OleL 
Here, 


Alalf> + Blg>)=4Alf>+BAlg> . (16.16) 
The operator A is bounded, if 
(Alou CAs >t (16.17) 
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for all | {> of H, C being the same constant for all | f>. Bounded linear operators 
are continuous. This means that for 


is (16.18a) 
Alfi> > AIf> (16.18b) 


also follows. Two operators A and B are equal (A = B) if, for all vectors | f > 
of H, 


AS (16.19) 


The following definitions are often used: 


a) unity operator i= lf—> ; 

b) zero operator 0:0/f>=|0> ; 

c) sum operator A+Bi(A+B)|f> =AIP>+BIPD : 

d) product operator AB:(AB)|f)> = A(BIf>) . (16.20) 


The relations shown here have to be valid for all | {> of H. With respect to the 
product operator, we have to add that, in general, 


AB #BA 
The commutator of A and B is defined by 
[4,B]_ =AB-—BA . (16.21) 


Now we explain the very important concept of the adjoint of a restricted 
operator. If an operator A* exists for the operator A for all | {> and |g) of H in 
such a way that 


(Ig AIM)=(AIMDID) . (16.22) 
then A* is called the adjoint operator of A. This relation can also be expressed by 
KglIAIP> = <flAT lo>* (16.23) 


The adjoint of an operator (16.22) possesses the following properties, which are 
easily derived: 


1) (aA)* =a*A* ; 

2) (A+ B)* =At++Bt ; 

3) (AB)* = BtA* ; 

A (AeA (16.24) 


All these properties were discussed and proved in Chaps. 4 and 10. On the basis 
of the definition given above, the properties may immediately be confirmed. 
An operator A fulfilling the relation 


AAS (16.25) 


is called a Hermitian operator. From this it follows that the expectation values 


16.3 Eigenvalues and Eigenvectors 


are real: 


Lf AWES SGA PS? SAS ee (16.26) 
16.3 Eigenvalues and Eigenvectors 


We speak of an eigenvector |a> of the operator A belonging to the eigenvalue a 
in the case 


Alay) =ala> . (16.27) 


Here, the eigenvalue a is, in general, a complex number. Especially for Hermitian 
operators A(A* = A), the following is true: 


a) The eigenvalues of Hermitian operators are real. 
b) If ja’) and |a"> are two eigenvectors of an Hermitian operator A with two 
different eigenvalues a’ 4 a”, then 


<a’ | a> — 0 
c) The normalized eigenvectors of a bounded Hermitian operator A create 


a countable, complete orthonormal system. In this case the eigenvalues are 
discrete. Then we speak of a discrete spectrum. 


We therefore can conclude that an arbitrary vector |y> may be expanded in 
terms of the complete orthonormal system |a> of the Hermitian, restricted 
operator A: 


W>=Yla>d<aly> . (16.28) 
As noted above, we have 
<a'|a"> = daar - (16.29) 


The scalar product of two vectors |@> and |v > may be expressed in the 
A representation; also, 


<elW> =) <elad<aly> . (16.30) 


Here, a helpful trick has been used. If we introduce the unity operator 1, known 
as completeness, by 


l= Sle><al - (16.31) 
we get 
W>=fly>=Vla>d<aly> , (16.32) 


and, further, 


<elW>=<elily> =L<elarcaly> » (16.33) 
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which is consistent with (16.28) and (16.30). The expansion (16.32) implies that 
Yikalyorat . (16.34) 


Therefore we may also say that <a|y> is square-summable. Apparently the 
abstract Hilbert space is mapped onto the space of the square-summable functions 
(eigenfunctions of the operator A). This we call the A represention of and mean 
the infinite set of numbers <a|y> in (16.32). Applying an operator B to |W) 
yields 


<a’ |BlW> =) <a'|Bla"><a"|p> . (16.35) 
Thus the operator B can be written in the A representation as the matrix 


Ca|B la; cay (Blan 


See Bla.) <a;|Blaz> 
i catbjary | 6a2'Bla> Saal Bland. | 5a 


and the vector |y> in A representation as 


<a, |> 
{a2 |W> 
lW>> <a’ |p> = (16.37) 
Can 
\ : 
Therefore the operator B in A representation is a quadratic matrix; the vector 


| >, a column matrix. The operator A itself is given in the A representation of its 
eigenrepresentation as 


ee (16.38) 
Sometimes it is advantageous to write the (arbitrary) operator B in the form 
ae 2» la’) <a’ [Bla ><a". (16.39) 


The analogy of he representation of a vector in a Hilbert space to the compo- 
nents of a vector in vector space is evident. The choice of the representation 
coincides with the choice of the coordinate system in the Hilbert space. 

Now we proceed to the transformation of the A representation into B repre- 
sentation. Here, the so-called transformation matrix 


¢a|b> (16.40) 
plays an important role. In analogy to (16.38) it follows that 
Cb'| BI") = B'Syy (16.41) 


It is convenient to start from the unity operator 


le elo le (16.42) 


16.3 Eigenvalues and Eigenvectors 


The following relations can be understood immediately: 
CW = Cb |My =Y la d<a' ly, 
Ka’ |W> =a‘ lly = Va‘ DOH IY 
7 
<b'|Cib"> = <b |1Cib"> = Yo <b'la’) <a’ | Cla”) <a" |b") 


(16.43) 
Similarly to (16.42), we get 


<a’| BC a”) = <a’ |BiC|a"> = Y <a'|Bla”") <a" Cla"). (16.44) 


This means that for the matrix element of the product of two operators BC, the 
customary rules for matrix multiplication are valid. 


«CDS =————E>E——S—EEEEE Ss 


16.1 The Trace of an Operator 


Problem. Show that the trace of an operator is independent of its representa- 
tion. 
Solution. The trace! of the operator C in the A representation is 
trace C=) a'|C|a’) 
7 
Then we write 


trace C=¥ Ca'|Cla’> ... =trace ici 


=P YY <a’ |b’) <b C|b”> <b" |a’> 


i ie ie 


ye a alee 2 Cb > 
ae fee fey” 


I I 
bg) 


Pa =r 


yb alee 
<b’ 


Cie =) <b"|C|b"> 
E b” 


ig 


! In literature the German name for trace, “Spur”, is often used. 
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ee 


Exercise 16.1 Since C|c) = c|c)>, we have in the eigenrepresentation of C 


trace C=) cele) 


(@ 
aS Les le > =v 
Ga Ga 


SCS ———E—E—EE————EEe——EE—EEyEwwwwSE eae 


16.2 A Proof 


Problem. Show that 
By calle lag ena. Cee 


Solution. It can easily be seen that 


DLMKka'|Ela”>? = Ld <a’ Cla") <a‘ |Cla">* 


i ee 
a 


= YY <a'| Cla" <a"| C* Ia’) 
= Vac: la’) = trace CC*t 
Here we have used (16.23) and (16.44). 


16.4 Operators with Continuous or Discrete-Continuous 
(Mixed) Spectra 


Many operators occuring in quantum mechanics do not have a discrete, but 
a continous or a mixed (discrete-continuous), spectrum. An example of an oper- 
ator with a mixed spectrum is the well-known Hamiltonian of the hydrogen 
atom. Actually all Hamiltonians for atoms and nuclei have discrete and contin- 
uous spectral ranges; therefore they have mixed spectra. Usually the discrete 
eigenvalues are connected with bound states and the continuous eigenvalues are 
connected with free, unbound states. The representations related to such oper- 
ators cause some difficulties because, for continuous spectra, the eigenvectors 
are not normalizable to unity (cf. our discussion of Weyl’s eigendifferentials in 
Chaps. 4 and 5). 


16.5 Operator Functions 


1. Operators with a Continuous Spectrum 
The operator A has a continuous spectrum if the eigenvalue a in 

A\a> =ala> (16.45) 
is continuous. The states |a)> can no longer be normalized to unity, but must be 
normalized to Dirac’s delta function: 

<a > —o(a — a i (16.46) 
Here, the delta function replaces, so to speak, Kronecker’s 6 of the discrete 


spectrum [cf. (16.29) ]. In the expansion of a state |W > in terms of a complete set 
|a>, the sums [cf. (16.28)] are replaced by integrals: 


W>=Jla’><a'|w>da’ (16.47) 


<a’| > represents the wave function in the A representation. The inner product 
of two vectors |g» and |y> changes analogously to (16.30) into 


<elW>=f<ela><a'|y>da’ , (16.48a) 
which is sometimes written as 
<ol> =Jo*(a)w(a')da . (16.48b) 


Here, (a) = <a|w> may be understood (somewhat imprecisely) as a “wave 
function in A space”. Of course, it is just the A representation of | >. 


2. Operators with a Mixed Spectrum 
If the equation 
Ala) =ala> 


yields discrete as well as continuous eigenvalues a, we are dealing with a mixed 
spectrum (cf. Fig. 16.1). 
In these cases, the expansion of |y> in terms of |a> reads 


W>=Dla><a'ly> + flad<a'lydaa’ , (16.49) 


where the sti extends over the discrete, and the integral over the continuous, 
eigenstates |a>. 

In order to make the notation more compact, it is understood that }', or 
| ... da’ is split into the discrete and the continuous parts of the spectrum, if 
there are any, according to (16.49). 


16.5 Operator Functions 


Operator functions f(A) may be defined as a power series if the function f(x) 
can be expanded in this way. Thus if 


ee y, Cox" 


n=0 


305 


continuous 


| I 


discrete 


Fig. 16.1. A mixed spec- 
trum. For a < a, the spec- 
trum is discrete; for a > a it 
is continuous 
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then the operator function f (A) is defined by 


Cs y Gat (16.50) 


n=0 


For example, e4, cos A, etc. may be defined in this way. Another possibility of 
defining operator functions is obtained via their eigenvalues: if 


A\@ypeale >. 
then we have 


fCyla 4a j\a ae (16.51) 


For operator functions of the form (16.50), (16.51) follows immediately. Two 
exercises will illustrate these points. 


RSE SSE 


16.3 Operator Functions 
Problem. Derive the relation 


<b'|f(A)|b"> = 2, <b la’) f(a’) <a‘ |b") 


Solution. We calculate: 
<b'|f(A)|b”> = <b | f(A) i] b"> 
= Y <b’ lad <a‘ |f(A)la”> <a" |b") 


= Yb 1aD (4) bara” <a" |b") 
=) <b'la') f(a’) <a’ |b") ;, 


CNR CS) = oe 2 


16.4 Power-Series and Eigenvalue Methods 


Problem. Show by the method of power-series expansion (16.50) and by the 
method of eigenvalues (16.51) that 


Bo Reb 
eilP/2)0 — cos; ising 
isinS cos§ 


16.5 Operator Functions 


Solution. a) We use the power series of the exponential function and get 


el(B/2)0, _ y a4 rot 
oT: 


We have o2 = 1 =(4 ‘) and therefore o? = o,. For this reason, the series (1) 
splits into even and odd powers. We get 


ei(B/20, — 9 1p ie 
E a(S) et 2 


= 1 — _— a 
cos = + 16,S1n (2) 


b) We use the method of eigenvalues (16.51). It is suitable to introduce the 
vectors 


1 0 
.+1>=(4) and n-19=(4] ; (3) 


ie. the eigenstates of o, =(§ -{). This property is expressed by the notation 
\z, 17>. Now, it can easily be checked that 


0 1 
<cilonlsi> = (4 ») (4) 


To use the method of eigenvalues, we need the eigenvalues of o,. For this 
purpose, we solve the eigenvalue problem 


olxA>=Alxd> , (5) 


and find 4 = +1 and the normalized eigenvectors, 


L 71 1 1 


Using (1) of Exercise 16.3, we get 
ae ez = Se emlgs eC Alzy > (7) 


A= ae 


From this we are able to constuct all matrix elements. For example, for i = j = 1, 
we get 


. 1 ae 1 1 

Cx ee o> 5. oy(; ett 0 (4) 
1 fl ee 
a5 0y( | ie pase (6) 


1. i B 

ae i( B/2) SPD) we 

S28 —¢ = COS 
ar 5 5 


2 
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In a similar manner we derive the other matrix elements, and finally arrive at 


ee ede 
cos; 1sin3 
Z,j>= ( ‘6 | : (8) 


<z p[eneie: 
bs . . 
Isinz COS> 


Even the inverse operator A~! can be defined by the method of eigenvalues 
(and not only by the inversion of the matrix), namely: 


~ 1 
A |a > =—la> 2 (16.52) 
a 
With A|a’) =a’ la’), we have 


A‘A=AA '=1 


If one of the eigenvalues of A, i.e. one of the quantities a’, vanishes, the inverse 
operator cannot be defined. In this case, A~! does not exist. 


16.6 Unitary Transformations 


An operator U is unitary, if 

Dt =U (16.53) 
A unitary transformation is given by a unitary operator: 

ldiew> =Ulana> (16.54) 
Hence, for an operator, it follows that 


Girew | Anew | @uew) — ©U aera Anew (Ul aig) tC ala) 0) ene ae 


def fe 
—s / “t 
= {Oia Asia iodo 


Therefore 
Asta = U Awe U » OF 
Any 20) ie = Uae * ) (16.55) 


where we have used (16.53). It can easily be checked that scalar products are 
invariant under unitary transformations, because 


€ Dies (Gnawa GU Deal chap Dial ieee = bee ae 
(16.56) 


Also the eigenvalues of A,.ey are the same as those of AC (invariance of the 


16.7 The Direct-Product Space 


eigenvalues), i.e. 
Anew|Gnew> = U Ags O* O | dog> = U Antalaara> = Uainal doa 
1 
= angU |aya> = al Anew) - (16.57) 
It can easily be shown that, given 
Coa =AciaBua and (16.58) 
Dag = Ape ae (116.59) 


it also holds that 


oA. Boe, and (16.58a) 
Deg Ae eee (16.59a) 


The generalization of these relations is obvious: All algebraic operations remain 
unchanged by unitary transformations. 


16.7 The Direct-Product Space 


Frequently the Hilbert space must be expanded, because new degrees of freedom 
are discovered. One example we have already encountered is the spin of the 
electron (see Chap. 12). The total wave function consists of the product of the 
spatial wave function w(x, y,z) and the spin wave function (a): 


W(x, y, 2) x(e) 


We say the Hilbert space is extended by direct-product formation. The following 
examples explain this further. 

A nucleon may be either a neutron or a proton with nearly identical masses: 
mc? = 938.256 MeV, m,c” = 939.550 MeV. For this reason we consider it as 
a particle with two states, the proton state |p> and the neutron state |n: 


1 0 
p> - om He 7 (Doce. oe 


The vectors |p) and |n> span the two-dimensional charge space or isospin space 
(in analogy to the spin). Since the nucleon may also occupy two different spin 


states 


n>=(5) and ot (1661) 


the direct product space consisting of spin and isospin space is given by the 
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four-dimensional space with the basis vectors 


I 
‘pt>=(_ | " 
— ee = 
i 0 charge 0 spin 0 
0 
0 
ie 1 . ae 
IP 7 0 charge ! spin 0 
0 
0 
0 1 0 
mT) Coda” 
1 charge 0 spin I 
0 
0 
0 0 0 
nb=(t) x(t) 0 (16.62) 
I 


Thus, in this four-dimensional space, the charge properties as well as the spin 
properties of the nucleon can be described. If further “intrinsic” properties 
(i.e. more inner degrees of freedom) of the nucleon should be discovered, the 
space will have to be further enlarged. In fact, a situation similar to the one just 
discussed arises if we consider particles and antiparticles.* 


16.8 The Axioms of Quantum Mechanics 


It is not easy to summarize the axioms or rules of quantum mechanics. Here we 
will follow E.G. Harris? and refer to the extensive discussions of von Neumann* 
and Jauch?. 


? We will encounter this situation in Vol. 3 of this series, Relativistic Quantum Mechanics, 
where the Dirac spinor also turns out to have four components: two for the spin and two 
for the particle-antiparticle degrees of freedom. 

> EG. Harris: A Pedestrian Approach to Quantum Field Theory (Wiley, New York 1972). 
* J. von Neumann: The Mathematical Foundations of Quantum Mechanics (Princeton, 
NJ 1955). 

° JM. Jauch: Foundations of Quantum Mechanics (Addison-Wesley, Reading, Mass. 
1968). 


16.8 The Axtoms of Quantum Mechanics 


Quantum mechanics is based upon the following correspondence between 
physical and mathematical quantities: 


1. The state of a physical system is characterized by a vector (more precisely: 
by a vector beam) in Hilbert space. Hence, |y> and 1| > describe the same 
state. In general, the state vectors are normalized to unity, to enable the 
interpretation of probability. 

2. The dynamic observable physical quantities (observables) are described 
by operators in the Hilbert space H. These operators of observables are 
Hermitian operators. Their eigenvectors form a basis of H; any vector of H 
may be expanded in terms of this basis. 


These general principles are supplemented by the following fundamental phys- 
ical axioms: 


Axiom 1: As a result of the measurement of an observable, only one of the 
eigenvalues of the corresponding operator can be found. After the measurement, 
the system occupies that state which corresponds to the measured eigenvalue. 


Axiom 2: If the system occupies the state |a’>, the probability of finding the 
value b’ in a measurement for B reads 


gees )— ca |b (16.63) 
If B has a continuous spectrum, 
AVA GB =k a |b >|? db’ (16.63a) 


is the probability that B has a value within the interval between b’ and b’ + db’. 
Axiom 3: The operators A and B, which correspond to the classical quantities 
A and B, fulfil the commutation relation 

(A, B]_ = AB— BA =ih{A, Bh , (16.64) 


where {A, B},, is the operator which corresponds to the classical Poisson 
bracket, 


0A CB O0AOCOB 
oe a a (16.65) 

7 \6q: 0p; Op; 04; 
q; and p; are the classical coordinates and momenta of the system. It follows that 
ai; Gj l- = Lees pjl- =0 sy L4is pj l- = ihoyl ’ (16.66) 


and, similarly, for the orbital angular momentum, 
L=rxp=(yp, — 2Py, Zp. — XPz, XBy — yp) oy 


(2 oLy _ (Gye ~ 
0qi CDi Op; 04: /op 


ee Je ih), 


= ih[( eas Py)( = x) = (i) Cpe les 
Siete, = wea tne (16.67) 
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For the other angular-momentum commutation relations we get a similar result 
and may write 


LxL=ihL . (16.68) 


We should pay attention to the following consequence of this axiom. If we colin 
the expectation value of an observable A by 


<A> = <WlAly> (16.69) 


and the uncertainty (the mean variation) by 


AA =/((A-KADYPD = SWIA-CHIALD YP ID (16.70) 


it follows that (see Chap. 4, Heisenberg’s uncertainty relation and arbitrary 
observables) 


CIP GIB = AIA BID (16.71) 


This is the general formulation of Heisenberg’s uncertainty relation. In partic- 
ular, for the variables p; and q;, using (16.66) we have 


h 


Hitherto, we have dealt with states (vectors) and observables at one instant of 
time. The dynamics of a system may be described in different, equivalent ways. 
The most customary one is the Schrodinger picture, in which the state vector is 
time dependent, but the operators of the observables are independent of time. 


Axiom 4: If a system is described by the state |,, > at time fp and by |W,» at time 
t, both states are connected by the unitary transformation 


> = T(t —to)lYo> . (16.73) 
where 
Gite 10) = exp| = 5 Ale 10) | ; (16.74) 


and H is the Hamiltonian of the system. 
From (16.73) and (16.74), Schrédinger’s equation follows. Let 


dt =1— to, d|P>=|Yiora>—lYio? and U(dt)=1->Hde ; 
then 


- += =H ; ' 
oo WW) = Hy) (16.75) 
Notice that Schrodinger’s equation is generally valid. In particular, it is valid 
for time-independent, as well as for time-dependent, Hamiltonians H. Only 
in the first case (H time independent) may we conclude (16.73) from (16.75) 


16.9 Free Particles 


(cf. Chap. 11). Therefore the special form of the time development (16.73) is only 
valid for time-independent Hamiltonians.° 

The Heisenberg picture is another description of the dynamics of a physical 
system, which is equivalent to the Schrédinger picture, as mentioned above. We 
obtain it from (16.73), by applying the unitary transformation 


Won = Us = UU [dios = [Yio s (16.76) 
to the state vectors. Then the operators transform according to (16.55), and we get 
ey 0, AU, | (16.77) 


The subscripts H and S stand for “Heisenberg” and “Schrédinger”, respectively. 
In Heisenberg’s representation, the state |W,>y =|,,>s 1s apparently a fixed 
time-independent state. Compared to this, the operators 


Ay(t) = exp] 4. * He = ) | exp| - ; A(t— | (16.78) 


are time dependent because of (16.77) and (16.74). By differentiation of (16.78), 
we find that A,,(t) fulfils the equation 


ee A, HAR] [An l_ (16.79) 


It is called Heisenberg’s equation of motion for the operator Ay in the Heisenberg 
picture and has to be considered in analogy to the classical equation of motion 
of a dynamic variable A in the form of Poisson brackets, 

dA 
= {AH} . (16.80) 
at 
Heisenberg’s equation leads immediately to the important result that an oper- 
ator which commutes with the Hamiltonian is a constant of motion. 


16.9 Free Particles 


It will be useful to study the motion of a free particle more carefully and to 
systematically summarize the various mathematical operations and tricks once 
again. First we consider the free motion of a particle in one dimension and later 
we will devote ourselves to the three-dimensional problem. The dynamic vari- 
ables are now the coordinate x, the momentum p, and the Hamiltonian is 
H = p2/2m. The eigenvalue equations for x and p read 


Ne yee (16.8 1a) 


6 See W. Greiner, B. Miiller: Theoretical Physics, Vol. 2, 2nd corr. ed., Quantum 
Mechanics II — Symmetries (Springer, Berlin, Heidelberg, New York 1994), especially the 
section on isotropy in time. 
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Pip >=p'lp> . (16.81b) 


By definition, a truly free particle may occupy any position x’ and also have any 
momentum p’. Therefore in (16.81) we have to deal with continuous spectra, so 
that the eigenstates |x’) and |p’) must be normalized to 6 functions 


Ce eto (x — (16.82a) 

CP |PiearO(P — Paw (16.82b) 
Using the commutation relation 

oS) (16.83) 


we can calculate the matrix elements of p in x representation: 
€x"|XP — P&|x"> = (x'| RIP — plX|x”) 
=f dx" [<x'181 x") Cx"IBLx”) — Cx! Ix”) x81] 
mabe ee ad oe tale” 
— <x'[plx”)x"8(x" — x”)] 
=X <x |p|x > x pix pe x Pe oe 


(16.84) 
and, on the other hand, because of (16.83), 
<x’ |xp — px|x”)> =1hd(x'— x”) , (16.85) 
so that 
(ex )< x |BIx > =thele — x) (16.86) 
With the aid of the identity 
: 0(x) = —6d(x) 16.87 
& AE x) = —o(Ce) | (16.87) 
we get 
hd ieee at! = _ih i tt Oe = tf 
ht Vere 
; 06(x’ — x") 
ee h P _ 1S cl aa a aS 
ih(x’ — x") ay! (16.88) 
Finally, by using (16.86), we obtain 
f A (ae i 0 iy Mt 
Copii = Sire Ole —x") . (16.89) 
In the following exercise we will recalculate the analogous relation 
f A. tt bs 0 t tf 
We Sis wl i). (16.90) 


Op’ 


which is what we expect, because of the antisymmetric position of % and p 
in (16.84). 
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NERS EE ESS ESS ESS SESS 


16.5 Position Operator in Momentum Space 
Problem. Prove the relation 
0 
/ x uw = ih ae 0 , = wr 1 

<p'|X|p"> More (P=?) (1) 
in a similar manner to relation (16.89). 
Solution 

pp ap (ei pixp > 

=f dp’ [<p'|8|p""> <p lp" > — <p'lBIp"" ><" 8p" >] 

= § dp’ Cp" 5(p” = Bp icp lxlp > 


OCD cea P lec ap) p | ip) (2) 
and, on the other hand, because of (16.83), this is equal to 
ihd(p'— p") . (3) 
So we get 


—(p' — p")<p'|X|p"> = ihd(p’ — p") 
[according to (16.87) ] 


0 

= =Wii a tad ———.. 6(p’ — p”) 

2 7p 5p >") 
= ih(p’ sy op) (4) 
= IN(p p ap’ p Pp . 

It follows that 
peru ii 

ONPEE teats 1? ) (5) 


The matrix elements <x'|?|x’> can also be calculated directly by comput- 
ing the matrix product. Briefly: 


<x" | p2|x”> = x’ |PEBIx”> = fdx'” <x BL") CD") 


F 0 ! wt 6 0 wr " 
= es | -in sats —x (-in Snalw —x ')| 


0 
—ih <I ax Ox a salt (-it aa 5(x"" a x") 


0 2 
(-in 5) Noe = 3 (16.91) 
Ox 
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Similarly, we get the more general relations, 


0 n 
AG VR = (-i =) d(x’ — x") and (16.92) 
> S 0 ‘ 2 " 
CP |X |p p= (is =) d(p'’— p") . (16.93) 


Now we consider the eigenvalue problem for the momentum in coordinate 
representation: 


Pip >=p'|p> . (16.94) 
We have 


I 


2 é t tt uy f 
(x"|P|p"> fdv"<x tls” ><x"tp'> = fav" (in 30 ie Ke [p> 


=e de Ox nes yx ip 


~ih <x’? (16.95) 
)x 


On the other hand, it follows from (16.94) that 
<x Pip) = p<x'|p> , 
so that the differential equation for <x’|p’), 


o 0 i / / i i 
ANG NED = PONE Ds (16.96) 


results. Its solution is 


1 1 
x'|p’> =w,(x’) = ——— exp (; P| : (16.97) 
Here, we have chosen the normalization in such a way that 
Cp" |p'> = J dx’ Cp"|x'><x'[p'> = J dx W(X p(x’) = 6(p" — p’) 
(16.98) 


Now we generalize the above results to three dimensions. According to (16.66), 
the three space coordinates commute with each other. Hence, they may be com- 
bined into the state 


Ix>= |x, yz). (16.99) 
By definition |x» is also an eigenstate of the operators X, ) and 2: 

AX D= xX |X), Ye oo=yle> 5 zie p72 lx > | 
or, in short, 


glx’) =x'|x’) (16.100) 
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As the spectrum is continuous, we may (must) normalize to 6 functions: 
ee — 0x — x Ji o(x —x Joly — y")o(z =z’) . (16.101) 


The operators p,, py, Pp, commute with each other, too, so that we may form the 
common eigenvector |p) with 


vip pp \p >. (16.102) 
Again, we have normalization to 6 functions: 
<p" |p’ > = 6(p' — p") = O(p. — px)d(Py — py)O(p. — pz). (16.103) 


Now we want to return to (16.89). Every single step which led to this solution 
may be repeated for each component p,, py, p, with the state vector |x>. Thus 
we get 


a 

Cx'|P,|x" > = —ih = o(x' — x") 
Ox 

ete: (16.104) 
We may combine this in the form 

< ‘|p| = th 0 d(x’ — x") 

x'|plx’") = ay 
é a. Oe ee 
= ~in( a0 ae) Fie 3,1 Ol —x } 


(16.105) 


Similarly, we conclude immediately that 
/ A ig y ih 0 o( 2B %) 
= | — 
<p'|< |p pw le 


é é a 
ae . aa 5 Pe ” ; — mee " : — 6 fe wt , 
in( api (p' — p") ae (ere) bp (pep 
(16.106) 


which is analogous to (16.90). The differential equation (16.96) can also be 
generalized to three dimensions without any difficulties: 


0 f f i 
—ih— <x'|p'>=p'<x'lp> (16.107) 
Ox 
with the solution 


1 eae 
OP = Wi 2) pes (jes ) : (16.108) 


normalized to 6 functions. Using the results (16.91) and (16.92), respectively, we 
get the Hamiltonian of a free particle H = p’/2m in x representation: 


A 


Ze 72 
<x'|H|x Pee oN |e = 


V76(x'— x") . (16.109) 
2m 
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In p representation, this reads 


2 
<p'litlp"> = <p'1 2 Ip’ = 2 0) Sy (16.110) 


Now we turn to the time-dependent description. In particular, we are interested 
in the propagation of the wave which describes a free particle; this is called free 
propagation. For this, we use (16.73) and (16.74), and express W(x’, £) = ¢x'|W;> 
by W(x’, to) = <x'|Wio> as 
Ie> = expl —iH(t = toV/h]lYro> 5 
W(x', 0) = Cx) = C2" lexpl — iA (t — t0)/AI Noo 
= f d3x"¢x'lexp[ — iA(t — to)/h]|x"> <x" Yio? 
= id xX Grote SiglviGy aig) re (16.111) 
Here, 


G(x tix’ in) =< Expl (fp — tani > (16.112) 


is called Green’s function or the propagator. It describes the time development of 
the wave yy(x’, t), starting with the initial waves W(x", fo). Its explicit calculation 
can be accomplished immediately in the case of free particles with H = p*/2m: 


: a2 


1p 
G Lt SE ae dad? ‘de " Ppa? ’ a 
(x', t|x”, to) =f de p'd? p’<x P><plex|, on 


(C= | Ip" ><p"' |x" 
= {Jp o"<x'ip yexn] — 55 = | 
KOPP \ pole, 


= {4°0'<x' Ip ><" >exo| — T= 0)| 


2 
ap. 12 
=\ Gear exp} 2 ee r)-F AG> |} . (16.113) 


The integral can be computed analytically (cf. Exercise 16.6), giving 


m 22 in (x ex)? 
Geile ie) = | —— — ; : 
Pa) Eon exp Fy = | coy) 


Finally, we want to make some comments on the description of free particles 
with spin. This is simply done by constructing the direct product of a vector |x’), 
|p> or |> and a spin vector |o>. For particles with spin 1/2, the vector |o> is, for 
example, given by: 


1 0 
in 1>=(4) fb =l (16.115) 


The argument z of these spin vectors indicates that we have chosen the repre- 
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sentation with o, =(4 °,) being diagonal. Hence, we have 
lW,o>= |W>|o> and (16.116) 
= a W(x) 
Kxl¥, 6) = W(x)|o> = ee (16.117) 


Thus a spin-1/2 particle is represented by a wave function with two components 
(a spinor). 


EXERC(SE 


16.6 Calculating the Propagator Integral 


Problem. Calculate the propagator integral (16.113) 


f ‘ft eS d? p' 1 / at t Z 
G(x't|x"to)= J aaron? |g | Pi =«)- Fn |p 


— oO 


Solution. By suitably rearranging the terms in the exponent, we convert the 
propagator integral to 


re 1 eee Biieeio) |. ae 2 ieee 
G(x't|x to) = Grays J d*p P| — 5 2m ‘|p Cam . } 
(1) 


By adding and subtracting the quadratic complement in the exponent, we find 
i a i (t — to) 
rh car = a3 , = ee 
Gatto) = any ne exp(| h 2m 


an = ne x) 
«iL (t — to) | - (CRE }) 7 a 


Now we may put a factor in front of the integral: 


G i ee 1 e im Ce = a) 
= X 
eo) os alone 
ee; Biter, a =) | ’ 
nn p'ex} ie ((—=%o) ; ©) 
We substitute the new variables 
m(x” — x’) i (t — to) 
7 = po — ———_ dae = — : 4 
ie! > (a A om (4) 


The integration over d?p’ is transformed into an integration over 
d3p' = 4nP’? dP’. Here, attention has to be paid to the fact that the lower limit 
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of the integral becomes Zero: 


G(x't]x"to) 4n ’ ie (= SF 
5 =—e = 
as parma lor (ain 
ala expla cis) (5) 
0 
Because 
t = Jn 
ee se (6) 


we get immediately 


m oie im(x" — x’)? 
’ ” = a ee é 7 
ee (sane = 5] sep (5 b= ty ™ 


D> ——————————————E—EwEe ess 


16.7 The One-Dimensional Oscillator in Various Representations 


The harmonic oscillator plays an important role in many areas of physics; above 
all, in field theory (e.g. quantization of the electromagnetic field). Thus it is useful 
to summarize its properties. The Hamiltonian of the one-dimensional oscillator 
is given by 


a (1) 


and the corresponding energy eigenvalue problem reads 
H|EY=E|ED .- (2) 


We solve the problem (2) for different representations. 


a) x Representation (Coordinate Representation) 
Here, H may be written as (cf. (16.91) and (16.92) ] 
<x'|H|x") = A(x’, p')6(x! — x” 


A ho 
= H 5 i eens ) i = xe" 


a hoe h? 6? ma? 
EL 3 Po ene aes ce 2 SRE Manse x? . 
(s “a | 2m dx’? i 2 y (4) 
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Now (2) reads 


CH= a Ik ES 


Bi ee 
= (dx"H (x. | 5(x’ — x")<x"|EY 


A h o 
=8(x.2 
MW é 


Oo 
ax’ 


)<x'1e> = «xB» (5) 
In the last step, we have written down the right-hand side of (2). Using 
We(x’) = (x E>, we may write 
5 ’ ho ’ , 
His. 7 5 vals ) = Ey, (x') . (6) 
This differential equation, familiar from Chap. 7, has usable solutions only if 
E adopts the eigenvalues 

Be haw 5) 9. m= Ol1,2)..00° . (7) 


The solutions belonging to these eigenvalues are 


ma\* 1 
x’ H £ (6+) 2) 


with € = ./ma/hx’. H,(€) are the well-known Hermite polynomials. 


b) p Representation (Momentum Representation) 


Because of (16.816) and (16.93), the Hamiltonian (1) in p representation is 
given by 


<p'|H|p"> = A(x, p')6(p' — p”) 


F h 0 Ny " 
=| Fa 0!) (0 ~ P" (9) 
1 Op 
where 
Ee h oe Le ae ie 
ch SP eee py h? . 10 
Al <0’) one 2 ap’? a) 


Then, with <p’|E> = We(p’), (2) reads 
<p'|H|E> =f dp" <p'|H lp" ><p" E> 


5 h 0 " ” 
= fav’ (-F 2p") a0 — p ice | 


=Ev,(p') . (11) 
In the last step we added the right-hand side of (2). The resulting eigenvalue 
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equation in momentum space, 
moh? 0 ae 
ej ee ee! YoE£E ae 12 
is easily transformed into 


( cs + sprga Ph) dol —F vel’) (13) 


Im dp’? ~ ImF.w? ~ wo 
hoe ra cee 
(Fae ty 0?) vole) = Eval) (14) 


Here, we have set @? = 1/m*w? and E = E/m*w?. Equation (14) is identical to 
(4) and (6), respectively. Thus the eigenvalues are 

E, = ha(n+4) , n=0,1,...0 
or 


mw? 


E, =h (n+4)=ho(n+4) . (iS) 


ma 
Thus they are identical to the values we got in (7). In the same easy way, (8) is 
transformed into the wave function wW,z,(p’) in momentum space: 

mo 


ee | 
\ | aes — (97/2) 
Ves(P’) (= 5 Hale 


ma@ 


eo ee 1 
n ee (16) 


According to Axiom 2 [see (16.63) and (16.63a)], the probability of finding 
a particle in the energy state W(x’) in the interval between x’ and x’ + dx’ is 
given by 


dW(x') = |<x'|Eq >|? dx’ = |We,(x')|? dx’ « (17) 


In the same manner, the probability of finding the particle with a momentum 
between p’ and p’ + dp’ can be calculated: 


dW(p') = |<p'lEn >|? dp’ = |e, (p')I? dp’. (18) 


The wave functions in coordinate space W,,(x’) and in momentum space Wz, (p’) 
are connected by 


<x'|En> = We,(X’) = J dp'<x'|p'> <p'|En> 


i a: 
=j ee (; p's’) Ve,(P") (19a) 


7 | 
eB) = J gone (— prs) vets) ) (19b) 


respectively. 
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c) Algebraic Method (Algebraic Representation) 


The algebraic method does not explicitly use any representation to solve the 


eigenvalue problem (2); it is particularly useful in field theory. We introduce the 
operators 


mia 


A A 


a= ot 


i 
SSP 20a 
./2mhw oe 


and 


mo , 1 . 
— X-———)p . 
2h ,/2mho 
They are called the annihilation operator (a) and the creation operator (4*), 


respectively, of oscillator phonons. It can be seen that (4)* = 4*;ie.d =(a*)*. 
Their commutation relations are easily calculated: 


(20b) 


raat] | mo = i . mo . i °| 
a —x Do, == 
2 /2mho 2h ./ 2mha 
i 1 1 
So Se eee 21 
th [% bl +5, es < | 5 LP “ (21) 
and 
[A @]-=Cete a I- (22) 


In a similar manner, we find 


2 


= = = ; 7) tlie 
H = ho(a+a+4)=ho(N +4) , where (23) 
N=4a'é (24) 
We denote the eigenvectors of N by |n>, so we have 
N\|nd=n|n> . (25) 


We study the vector |g>, which results from applying the annihilation operator 
aio |n >: 


Ig) =4ln> (26) 
We study |g> by applying N and find that 
N\g> = 4* da|n> = (4a* — 1)a|n)> 
= G(a* 4)|n> — a|n> = Gn|n> — a|n> 


=(n—l)a[n>=(n— Dig? . (27) 
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Thus |g> is an eigenvector of N, too, but it belongs to the eigenvalue (n — 1). 
Therefore, assuming that the eigenvalue (n — 1) is not degenerate, |g> and 
|n — 1) are identical except for a constant. So we can write 


[i SB (28) 
The constant C, can be determined by calculating the norm 

(a|n>, a|n>) = (C,|n — 1), C,|n — 1>)< 

<nla* an) =|C,2<n — 1]n— 1)<en=|C,2 


Seting C, — Jn, a phase factor equal to unity is chosen, and (28) reads: 


jn) = /nin—1>_ . (29) 


Similarly, we may conclude that 


a ne = wnt Int lye (30) 


Equations (29) and (30) explain the nomenclature of annihilation and creation 
operator. Equation (25) suggests interpreting N as a number operator for 
oscillator quanta. Now n > 0 always; we see this by multiplying (25) by <n: 


<n|N|n> = nn|n> = <n|a* alny = <dnlan) ; 


hence, 
A A A 2 
<n|n> IIln> Il 
Starting with the vector |n>, the states 
ljn—1)>,|n—2>,|n— 3)... (32) 


may be generated by successive applications of the annihilation operator 4. 
Because of (31), only positive eigenvalues are allowed. Thus the series (32) has to 
terminate, that is with the state |O>. For this state we have 


a4j0>=0 N|O>)=0 . (33) 


|O> is called a vacuum state (also: ground state) for oscillator quanta. On the 
basis of (24) and (25), we conclude that the states |n> are eigenstates of H with 
the eigenvalues E,, = hw(n + 4): 


H|n> = ho(N + 4)|n> =ha(nt+4)[n>_, (34) 


Frequently the matrix elements of X and p have to be determined. They are 
easily specified by solving (20) for X and p, respectively: 


in 
6 = A+ A 
2m ed) (35a) 
. fmho _. ; 
Pair (Ps (35b) 
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and, by using (29) and (30): 


P jaan 
<ny|X|n2> = al tT mes + Jian | 
. _ |mho 
(ny |p|n2> =1 7 | Mat Danes — Jad | : (36) 


In a similar manner we get 


<ny|X?|n2> = DY) Cry |X] <n|X|n2> 
n 


h 
| PE Tone + VI | 


2mo “, 


x| ra Tames + Viabun- | 
h 
==— an Moe e/a) 
X Ones ai \/ ny ar 125-2 | (37) 


and 


mho 
<n, |p>|n2> = oa am ln MD Onns _ Ste Ny + 1 
Oman 2 aN ny ar 1 aba | . (38) 


From these two equations we obtain 


m 


5 1 . wo? . 
<n |H n> = 5 Cm lB? |na> + 5) (ny |X2|n2> = heo(ny + 3)dninr (39) 


SST SSE SS 
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Usually it is not possible to give an exact solution of a problem in quantum 
mechanics; we have to restrict ourselves to approximate solutions of the 
Schrédinger equation 


ih 1) = (Ho + HD (16.118) 


The splitting of H = Ao + H’ is chosen in such a manner that the solutions of Ho 
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are known: 
ile, — 21 ae (16.119) 


and H’ is small enough compared with Hy, so that its influence may be 
considered as a perturbation. In Chap. 11, the perturbation H’ was denoted by 
eW. We expand |y> in terms of |g,,>, and write 


1b> = 5 Caltex —j Bat ond , (16.120) 


Inserting this into (16.118) and taking into account (16.119), the following 
system of coupled differential equations for the expansion coefficients C,(t) is 
obtained: 


d 


a om) = -75<oalH'\0n)ex0] § (Em ~ Ea Cih) i 0 


(16.121) 


These equations may easily be transformed by integration into a system of 
coupled integral equations: 


C(t) = Cn(O) — = § dt’ <@m|H'| n> exp F (Em — Es | C(t’) 
n O 
(16.122) 


Up to this point, everything is exact. Now approximations enter. If we assume 
the system to be in state |g;> at time t = 0, we have 


C10) = ban (16.123) 


As the perturbation H’ is assumed to be small and of little influence on the 
undisturbed states |g,>, it is consistent to assume that none of the C,(t) 
coefficients differs considerably from the initial value. Further, assuming that H’ 
is independent of time, it follows for f #1 that 


i s : i 
C(t) = — h <@r|H'|9;> f tr exp| FE a ee | 
0 ] 


1 a exp[i(E, = E;)t/h] — | 
——— Wig = ae eee 
5 tl jo] i(E, — E,y/h (16.124) 
After time t, the probability of finding the system in the state |g,> is given by 
4 S sin?(a,;t/2 
COP = SiolPlepp ee (16.125) 
h Fi 
with 
- E; — E; 
Qk = 
fi h 


The function sin*(wt/2)/w? has a peak at w =0 as a function of w, which 
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becomes sharper if t increases. Moreover, we have 


a ry ae (16.126) 
because 
i nes oa 
Clearly this means that 
lim Se (16.127) 
10 


Inserting this result in (16.125), we obtain immediately the transition probability 
per unit tinte from state |~,> to |@,>: 


Gea)? bs 
OT = KC onlA'lePO(Er— Bi) (16.128) 
This is Fermi’s golden rule. Here, the 6 function expresses the conservation of 
energy. The 6 function is a distribution and as such its significance in a physical 
formula is not quite clear. It disappears from (16.128) if we consider that in all 
practical cases we have to integrate over a continuum from a lower to an upper 
energy limit (cf. the extensive discussion in Chap. 11). 

Now the general approximation scheme for solving the system of integral 
equations (16.122) is evident. To obtain higher approximations, we have to 
iterate as often as necessary. These calculations are tedious; however, their result 
is simple enough to be presented here without proof. Generally the transition 
probability per time for the transition 1— fis given by 


he ‘lit 
(ee probabili *) _2n \Mel25(Ee— Ey) (16.129) 
time Gai h 
Here, the transition matrix M,; reads: 
A CfA ID CAD 

My = CflFli> + eg oe 

Fog UE ey 
(E, — E, + in)(E; — Ey, + in) 


Py: 


(16.130) 


To simplify the formula, we have abbreviated (o,|H'lo:> = <f|H'|b, ete. 
The states I, IJ are intermediate states via which the higher-order transitions 
proceed. The infinitesimal quantity 7, which appears in the denominators, is 
positive and indicates how the singularities in the expression M,;’ are to be 
treated. 


7 See the extensive presentation of perturbation theory in Chap. 11 and, e.g., in A.S. 
Davydov: Quantum Mechanics, Chap. VI (Pergamon, Oxford 1965); L.I. Schiff: Quantum 
Mechanics, Chap. 8, 3rd ed. (McGraw-Hill, New York 1968); A. Messiah: Quantum 
Mechanics, Vol. II (North-Holland, Amsterdam 1965). 
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17. Conceptual and Philosophical Problems 
of Quantum Mechanics 


In the preceding chapters we have tried to develop the conceptual basis of 
quantum mechanics. In addition, various exercises have demonstrated how 
quantum theory can be used to solve real physical problems. Of course, these 
problems represent only a small fraction of the many applications of quantum 
mechanics; numerous experimental results involving physics and chemistry can 
be explained successfully by adopting this theory. Up to now, no prediction 
made by quantum mechanics has been disproved experimentally. In spite of this 
success, conceptual problems of the theory exist; in fact, there have been many 
attempts to interpret quantum mechanics in a new way or even to replace 
it by another theory based on a more obvious conceptual and philosophical 
foundation. 

Therefore, in this chapter’, we will try to gain insight into the principal 
conceptual difficulties of quantum mechanics, which we will illustrate. The most 
important alternatives will be presented. Many questions occuring during this 
discussion belong to the realm of opinion rather than to that of fact; therefore, 
many physicists assign these considerations to the field of philosophy. Never- 
theless, the basis of quantum mechanics is so important and fundamental to our 
perception of microscopic processes in nature, that every physicist should at 
least understand the nature of the principal conceptual problems and questions 
arising therefrom. 


17.1 Determinism 


Quantum mechanics is an indeterminate theory which states that there are 
physical measurements whose results are not definitely determined by the state 
of the systems prior to the measurement (at least, as far as it is in principal 
possible to observe that state). If, just before the measurement, the wave function 
of the system is not an eigenfunction of the operator whose observable is to be 


1 In conceiving this chapter, we were guided by similar discussions by A.I.M. Rae in his 
book: Quantum Mechanics (McGraw-Hill, London 1981); M. Jammer: The Philosophy of 
Quantum Mechanics: The Interpretations of Quantum Mechanics in Historical Perspective 
(Wiley, New York 1974). 
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measured, then the result of the measurement is not definitely predictable; only 
the probability of the various possible results can be determined. 

By way of example, recall the Stern-Gerlach experiment (Chap. 12). Ifa beam 
of spin-} particles goes through an experimental setup oriented in the z direc- 
tion, the beam is split up according to the two possible directions of s,. If the 
beam consisting of s, = +4h particles then travels through another apparatus 
measuring the spin component in another direction, then the beam will split 
up again into the two components +3 and — 5h. The probability of these two 
results can be calculated, but one cannot predict whether a single particle will 
have spin up or spin down. 

The indeterminism of quantum mechanics is opposed to the determinism of 
classical mechanics. For the latter, the evolution of a system is definitely fixed by 
its initial state and by the forces acting on it. Even the final state of a “purely 
random” experiment, like the tossing of a coin, can be determined if the initial 
position, velocity and angular momentum of the coin, as well as the gravi- 
tational and frictional forces acting upon it, are known. 

The randomness of quantum mechanics, however, has a totally different 
character: results of measurements are not an unequivocal consequence of the 
previous state of the system. The fact that spin-} particles are eigenstates of $, 
fully determines the part of the wave functions attributed to the spin, and these 
particles are identical. Nevertheless, they may act differently if another spin 
component is measured. The guiding-field interpretation introduced in Chap. 3 
permits only statements concerning probabilities of the individual behaviour of 
the particles moving in the guiding field. 

To circumvent the problem of indeterminism, it was suggested that particles 
in the same state just seem to be identical, and that in reality, they have 
additional, different properties which determine the result of a further experi- 
ment. For instance, a particle in an eigenstate of s, would also have a fixed value 
of s,, even if the latter value cannot be measured. Theories based on assumptions 
like this are called hidden-variable theories; the properties of these theories will 
be discussed below. 


17.2 Locality 


We now turn our attention to a second problem of quantum mechanics, the 
so-called locality of the theory. When talking about particles, we assume them to 
be pointlike or at least significantly smaller than other typical dimensions of the 
system under consideration. Thus it makes good sense to assume that a particle 
“feels” the value of some field acting upon it only at its momentary position. 
Now we will discuss two examples in which that assumption proves incompat- 
ible with the outcome of the measurement. 

First we will look at an experiment with a parallel beam (e.g. a light ray or an 
electron beam) falling through a double slit (see Fig. 17.1). At some distance from 
the slit, a row of detectors register the arrival of photons or electrons. After 
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Fig. 17.1. Double-slit ex- 
periment (schematic) 


a sufficiently long period of time, the number of particles counted by a single 
detector is proportional to the intensity of a wave at that point, which has been 
calculated by assuming that the single waves interfere. The wave model (guiding 
field) as well as the particle model are necessary to explain this result: the 
detector registers particles of a definite energy, but the interference pattern can 
only be explained by the passage of the waves (of the guiding field) through 
both slits. 

Now we alter the experimental setup by placing the detectors directly behind 
each slit. Then particles will be measured passing through one of the slits, but 
not through both at the same time. But the appearance of an interference pattern 
necessarily requires that a wave pass through both slits at once; this can be 
shown by shutting one of the slits during the previous experiment. Therefore the 
probability of observing a particle at a certain point depends upon whether 
only one or both slits are open. We could conclude that the “object” (photon or 
electron) acts only in some situations as a particle (and in other situations, 
as a wave), or that it is influenced by a slit through which it does not pass. The 
second assumption implies a kind of nonlocal interaction. 

A further remark: this difficulty in interpreting the experiment can be 
avoided by applying the concept of a guiding field y, leading the single particles 
statistically at the various space-time points corresponding to |y(x, y, z, t)|?. 

One further example of nonlocal consequences in quantum mechanics is 
given by the behaviour of particles separated in space, whose properties are 
correlated in some way. A system of this kind was first discussed by Einstein, 
Podolski and Rosen”. Here we will develop a Gedankenex periment, conceived of 
by Bohm?, and look at a pair of spin-3 particles with total spin zero, each having 
zero orbital angular momentum. Pairs with these properties may be created by 
the scattering of a beam of low-energy protons from hydrogen gas. The incom- 
ing and target protons then withdraw to a state with vanishing orbital angular 


2 A. Einstein, B. Podolski, N. Rosen: Phys. Rev. 47, 777 (1935). 
3 D. Bohm: Quantum Theory (Prentice-Hall, Englewood Cliffs, NJ 1951) pp. 614-622. 
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momentum and total spin. When the protons are far apart, we measure, for 
instance, the z component of the spin of the first particle (s,,) and, afterwards, the 
same component of the second particle (s,2). Since the total spin is zero, the 
components must have opposite sign: s,; = 5h,s,2 = —th. Thus, in principle, 
the second measurement is not necessary, as the value of s,, can be deduced 
from the value of s,,. After the measurement of s,,, the second particle is in an 
eigenstate with s,,. This also means a change in the probabilities of the measure- 
ment of another spin component of the second particle [see the more detailed 
discussion below, following (17.1)]. 

So the result of the measurement of the second particle is influenced by the 
measurement of the spin of the first particle, although both particles are far 
apart and do not interact with each other. According to quantum mechanics, 
two particles in a state of this kind are not independent of each other. If they are 
nevertheless considered to be independent, then a hidden variable theory must 
be introduced. 

Attempts have been made to develop a theory conserving determinism and 
locality that could reproduce all the experimentally proved results of quantum 
mechanics. Only recently has it become clear that this is not possible. This will 
be elucidated in the following section. 


17.3  Hidden-Variable Theories 


The aim of a hidden-variable theory is to consider quantum mechanics as 
a Statistical theory in the sense that it furnishes probabilities of possible events, 
which in reality are fixed by nonobservable properties. This is in analogy to 
classical statistical mechanics, which incorporates random fluctuations as a prop- 
erty of thermodynamical systems, although a discussion of individual atoms 
should yield completely deterministic behaviour. It would be advantageous 
to find similar substructures for quantum mechanics to preserve locality and 
determinism. Of course, a theory including hidden variables has to reproduce all 
the experimentally confirmed results of quantum mechanics. On the other hand, 
we could decide which theory is better if new experiments were performed, for 
which the theories predict different results. 

One such experiment, for which quantum mechanics and the hidden- 
variable theory give different results, is Bohm’s Gedankenexperiment. To prove 
a discrepancy, we first have to derive the quantitative predictions of quantum 
mechanics for measuring the spin component sy. of the second particle at an 
angle @ to the z axis, having previously determined the spin component s,, of 
the first particle. If the first measurement results in the value + 5h, s,. neces- 
sarily has to be negative. The spin part of the wave function of the second par- 
ticle therefore is (compare Chap. 12): 


0 
a (?) | (17.1) 


17.3 Hidden-Variable Theories 


The operator Sy, standing for the spin component at an angle ¢ to the z axis, is 
given by 


ae h (cos 
fs = $.005 6+ 5,sin. = 2 { ; p Uae : (17.2) 
2\sin@ —cos¢ 
Sy has the eigenvalues + h/2 and — h/2, and the corresponding eigenvectors are 


3 5 cos (/2) — sin(¢@/2 : 
easily determined as ( sin (6/2) and ( cos 3). We expand the wave function y~_ as 


a linear combination of these two eigenvectors, which gives: 


UN 8 cos (@/2) — sin(@/2) 
(?) = sin(¢/2) (2 a + cos (¢/2) ( ae 7:3) 


The probability that the second measurement will result in a positive value is 
therefore P, .(d) = sin? (/2). In analogy, we define the probabilities P,_,P_+, 
P__ for the various possible results of both experiments. Similar considerations 
lead to 


P, +(b) = sin?($/2), P, -(¢) = cos*(¢/2) 
P_+(¢) = cos*($/2) , P--(¢)=sin*(6/2) . (17.4) 


It is useful to introduce the correlation coefficient C(@), defined as the average 
value of the product S,,S 42, averaged over a great number of measurements of 
such pairs of particles. Then, 


hh? 
OQ) = Eee) Pee) (0) 


vs 2 
a *[sin?($/2) — cos*(¢/2)] = — *-cos Oo: (17.5) 
Now we will look at an example of a hidden-variable theory which assumes 
all spin components (only one being determinable at a given time) to have fixed, 
but unknown, values. This means that a “real” spin vector, analogous to 
a classical angular-momentum vector, should exist. We choose as our example 
the two-particle scattering for a state with vanishing total spin, which has 
already been explained. For total spin zero, it follows naturally that these “real” 
spin vectors of the two particles should have the same value but opposite 
direction, when the particles move away from each other. In quantum mechan- 
ics, only one spin component ofa particle can be determined at a given time. The 
only possible results for that component are +h. The other components, 
which are not measurable, may be thought of as hidden variables. We assume 
they exist, but it is not possible to determine them at the same time as the 
previously chosen component (which can be any of the three components). 

We denote the “real” spin vector of a particle by s. This spin vector s has to 
interact with the experimental apparatus in such a way as to ensure that the 
value of a spin component is always measured as +h/2, although the corres- 
ponding component of s may have another value. This may be achieved by 
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Fig. 17.2. A “real” spin vec- 
tor s. It is postulated that 
a measurement in the z di- 
rection yields the result h/2, 
although the absolute value 
of the “real” spin component 
s, is smaller. A measurement 
in the x direction yields 
—h/2. This postulate has 
to be introduced in order to 
preserve the theoretically 
established (quantum-mech- 
anical) results of the meas- 
urement of a spin component 


Fig. 17.3. According to the 
theory with the hidden vari- 
ables s, and s,, based on the 
concept of a “real” spin vec- 
tor, the number of particle 
pairs that have a positive s,, 
component of the first par- 
ticle and a positive sy. com- 
ponent of the second par- 
ticle, is proportional to the 
hatched area. Two “real” spin 
vectors s, and s>, fulfill- 
ing these conditions, are shown 


requiring that if s has a positive z component, its measured value must always 
be +h/2, and that for a negative z component, the result must be —A/2. In 
general, this should be valid for any direction. 

The point here should be clear: the “real” spin vector of a particle does not 
have to be parallel to the z axis if we measure +h/2 as the result for the 
z component; it should only be somewhere in the upper hemisphere, with the 
z axis being the symmetry axis (see Fig. 17.2). 

The assumption that each individual particle has a well-defined but not 
measurable spin vector s is the basic postulate of the theory of hidden variables: 
if s, is measured, the components s, and s, also exist, constituting together with 
s, the total “real” spin. This concept of “real spin” can only be made compatible 
with experiments by artificially introducing unjustifiable prescriptions for the 
result of a measurement. 

In the following we will show that this leads to a correlation function C’(@) of 
the spin components of both particles, differing from the quantum-mechanical 
result C(@) of (17.5). If we get the value s,; = +h/2 for the first particle in our 
hypothetical experiment, the z component of the second particle’s spin has to be 
negative. The “real” spin vector of the second particle is therefore placed in the 
lower hemisphere, with the negative z axis as symmetry axis (see Fig. 17.3). If the 
second particle’s spin points in a direction at an angle @ to the z axis, Le. Sg2 
being positive, the spin vector of the second particle has to be in the hemisphere 
with the symmetry axis inclined by @ with respect to the z axis. Assuming the 
orientation of the spin vectors to be homogeneously distributed in space, the 
probability P’, .(@) for s,; and sy. being positive at the same time has to be 
proportional to the overlapping volume of both hemispheres (hatched part of 
the Fig. 17.3). 

The overlap is proportional to ¢ for 0 < @ < x. For angles between x and 
2n, the angle @ has to be replaced by 22 — ¢ in the following formula. By taking 
into account that P; .(z) = 1, it follows that P;4(¢) = ¢/x. The remaining 
probabilities can be derived by analogous considerations: 


Pid)=din , — PL-(#)=1-4/n 
P..(@)=1—/n , P_-()= dln 


Thus, for a hidden-variable theory, the correlation coefficient C’(¢), defined by 
the average value of the product of both measured spin components, is 


(17.6) 


h2 
OS oe 1) (17.7) 


This result coincides with the quantum-mechanical formula (17.5) only for 
@ = 0, 6 =n, or 6 = x. For different orientations of the experimental appa- 
ratus, the results differ considerably (see Fig. 17.4). 

Thus we have shown a discrepancy between the predictions of quantum 
mechanics and the results of a hidden-variable theory, assuming that the single 


particles have “real” spin vectors. In the following, we will demonstrate as well 
that any hidden-variable theory will end up predicting experimental results that 
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contradict quantum mechanics. The first proof of this important statement was 
given by Bell,* and is thus called Bell’s theorem. 


17.4 Bell’s Theorem 


To succeed with a general proof, we first have to state the minimum require- 
ments of a local deterministic hidden-variable theory for the case of Bohm’s 
hypothetical experiment. After the two particles have separated, the system 
should have a property that determines in advance the result of measuring any 
spin component of any particle. In the previous example, that property was 
granted by the existence of a “real” spin vector, but for the general case a hidden 
variable does not have to correspond with a parameter of a special physical 
model. 

The result of a measurement of the first particle’s z component of spin s 
is designated as s.,(A), where the only values allowed for s,, are +35h, and 
4 represents the hidden variable (of course, 4 may be replaced by multiple hidden 
variables; the following discussion can easily be extended to that case). A theory 
of this kind is deterministic, since the values of s,; and sg2 are given by the value 
of 2; the theory is local, because the result of an experiment does not depend on 
an experiment determining the spin value of the other particle. Every particle 
pair has a definite value of 4, and we define the probability density p(A), giving 
the probability of a pair with a value between 4 and 4 + dd, as p(A)d/. The 
normalization condition is 


‘{ p(Ajdi = 1 


Now we consider an experiment determining the spin components s,, and 
S2 of a large number of pairs. The average value C”(@) of the products s,1 52 
follows as 


C'(P) = J 521 (A)So2(A)p(A)dA 


Let us look at a second experiment measuring, as before, the z component of the 
first particle, but where the second apparatus is placed at an angle @ to the z axis. 
We get an analogous expression of the correlation coefficient C”(6). It is 


C"(p) — C"(0) = J (Sei (2) Sp2(A) — S21 (A) Sp2(A)) P(A) aa (17.10) 


We know the spins of both particles to be of equal magnitude but opposite 
direction. Thus it follows that 


Se1(A) = — Sor(A) , Soi (A) = — Sga(A) 


(17.8) 


(17.9) 


(cain) 


4 J.S. Bell: Physics 1, 195 (1965). The following synoptic article can be recommended: 
JF. Clauser, A. Shimony: “Bell’s Theorem: Experimental Tests and Implications”, Rep. 
Prog. Phys. 41, 1881 (1978). We also mention a more popular article by B. d’Espagnat: 
“The Quantum Theory and Reality”, Scientific American Vol. 241, No. M5128 (1979), 
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Fig. 17.4. The quantum- 
mechanical correlation func- 
tion C(¢@) and the corre- 
lation function C’(d) of 
a hidden-variable theory 
(based on the existence of 
a “real” spin vector) are con- 
siderably different 


426 


17. Conceptual and Philosophical Problems of Quantum Mechanics 


IC(d) — C24) — C@) 


Fig. 17.5. The solid line 
shows the function |C(¢) — 
C(2@)| — C(@) according to 
quantum mechanics; the 
dashed line represents the re- 
sult of the same function for 
the hidden-variable theory 
containing “real” spin 


Inserting (17.11) into (17.10) yields 
C"(p) — C(O) = = Ser A)(so1 (A) — Sor (A) pA) 


== faulAso (1 — soa (2snl2)) play : (a 


where we have used <s;(4)>? = 4h?. The magnitude of (17.12) can be esti- 
mated by 


4 
(O°) = (| Fisss(A)so2)(1 = Fs seu(2sn(2)) plaid (17.13) 
Since the term p(/) always has to be positive and the spins only have values +h 
(thus |S_1(2)S41(4)| = $7), it follows by applying (17.8) and (17.11) that 


v3 


h 
IC"(g) — C"@| s (F zi sox (2)(2)) 0) 


h? : 

eee + J sgi(A)So2(A)pydd (17.14) 
We will restrict our consideration to cases where the z axis and the directions 
given by @ and ¢ are in one plane. The mean values defined by (17.9) only 
depend on the relative orientation of the single measurements — not on the 
absolute direction of the z axis (this may be proved by additional experiments). 


Therefore we can replace the integral in (17.14) by the correlation function 
C"(@ — ¢): we get 


&/% 


[S(O C (0) Se (aes (17.15) 
This inequality is called Bell’s theorem. It is a direct consequence of every local 
deterministic hidden-variable theory. We now have to establish whether this 
inequality coincides with the predictions of quantum mechanics. To do so, we 
look at the special case 0 = 2¢. The two correlation functions can be derived 
with the help of (17.5): 


2 2 


——cos@ , C(@)= aoe 


; 7 (17.16) 


Comparing (17.16) with (17.15), we find that quantum mechanics is only consis- 
tent with a hidden-variable theory if [h*(\cos @ — cos 2¢| + cos d) < $A’. 

Figure 17.5 shows Bell’s theorem to be satisfied for 52 < ¢ < a, but not for 
0 < @ < 45m. The function has a maximum when @¢ = x/3 with a value of 3h2/8. 
In addition, the figure shows the function (dashed lines) calculated by applying 
the previously discussed theory of “real” spin vectors (17.7). Of course, this is 
consistent with Bell’s theorem and inconsistent with quantum mechanics. 

It is remarkable that Bell’s inequality is invalid for that range of parameters 
in which the difference between the results of a hidden-variable theory and 
quantum mechanics is maximal. 


17.4 Bell’s Theorem 


Owing to these considerations, we have to conclude that no local determin- 
istic hidden-variable theory can reproduce the results attained by quantum mecha- 
nics for experiments of this kind. 

As natural scientists, we have to determine which theory is correct, i.e. which 
theory is consistent with nature. Thus experiments have to show which of the 
two theories yields better predictions. At first, one might think this unnecessary, 
since quantum mechanics has become so well-established on the basis of 
experimental evidence. But experiments performed before the formulation of 
Bell’s theorem did not check the consistency of quantum mechanics regarding 
this special point. In recent years, several experiments testing this point have 
been carried out. Although the first experiments yielded results consistent with 
Bell’s theorem (leading to doubts about the validity of quantum mechanics), the 
following more accurate experiments yielded results inconsistent with Bell’s 
theorem but in accord with quantum-mechanical predictions. Most of these 
experiments measured the polarization of correlated photon pairs, in which case 
the formulation of Bell’s theorem is slightly different from (17.15)°. (For our 
discussion, these minor differences yield no new information). 

Many of the difficulties concerning the nonlocal indeterministic character 
of quantum mechanics can be avoided by the Copenhagen interpretation, which 
treats the experimental apparatus as a part of the total quantum-mechanical 
system (see next section). As a variation of that interpretation, an additional 
formulation may be made by constructing a hidden-variable theory that also 
treats both correlated particles and apparatus as one system. In fact it is possible 
to construct such a nonlocal but deterministic theory, which reproduces the 
results of quantum mechanics. Here, the behaviour of the measuring apparatus 
at one point has to be influenced by another one, possibly placed far away at 
another point. This assumption proposes the existence of a new, up to now 
unknown, mechanism mediating the influence between the two apparatuses. 
A postulate of this kind seems to be unnatural, and, furthermore, the conceptual 
difficulties of this hidden-variable theory seem to be at least as extreme as in 
the case of quantum mechanics. Thus there is no reason to replace quantum 
mechanics by another theory. 

Hence, many physicists believe indeterminism to be a property of physical 
phenomena. Our common conception of causality results from our experience in 
the macroscopic world and cannot be translated directly to microscopic pro- 
cesses, where quantum-mechanical effects occur. There is no reason why there 
should not be some randomness in nature. If different causes can lead to the same 
effects (there are many examples of this in physics), it does not seem to be 
unreasonable that one cause could lead to different effects. The image of a guid- 
ing field leading its quanta according to the resulting probabilities seems to be 
a convincing concept. But, indeed, it is only a concept allowing predictions of 
expectation values, i.e. of results of many measurements. 


5 Further information is given in the article by Clauser and Shimony cited above (see 
footnote 4). A new experiment was done by W. Perrie, A.J. Duncan, H.J. Beyer, and 
H. Kleinpoppen, and reported on in Phys. Rev. Lett. 54, 1790 (1985). 
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17.5 Measurement Theory 


In this section we want to discuss in more detail what the measurement of 
quantum-mechanical quantities really means. Let us consider a system de- 
scribed by a wave function w(x). We want to measure a variable q represented by 
the Hermitian operator OQ. The eigenvalues q,, of O are the possible results of 
measuring that variable. The wave function may be expanded in terms of the set 
of eigenfunctions ¢,(x) of O: 


W(x) = Vandn(x) (7A 


Therefore the expectation value of Q is 
<0) = J w*Ovdx = (Fak) O( Lands Jas 


= ) ata, \0;00,dx— )) Gna.4, \)On0,4— ) ail aniee mee eae 
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where we have used the orthogonality relation of the eigenfunctions @,. For 
a single measurement, the probability of a result q, is given by |a,|?. 

Now we ask how measurement influences the system. Let the measurement 
have the result g,. Immediately after the measurement, we assume it to be 
repeated. If the measurements are to make physical sense, we have to claim that 
the experimental result does not change: the second measurement also has to 
result in the value g,. Since this value is assumed to result with certainty, the 
probabilities of the various results measured in the second experiment are 


|an|? = Smt 


Thus it follows that before the second measurement the system’s wave function 
was @;. We may say: the wave function of the system in the beginning, given 
by w, has changed because of the measurement. The system’s state after the 
measurement is given by @;: 


ie a 
by the measurement of Q with the result gq, 


That change of the wave function caused by a measurement is called wave- 
function reduction. The main problem of a measurement theory is to establish at 
what point in time this reduction takes place. 

For instance, if a particle moves through one slit of a double-slit experiment, 
this is not a measurement of the particle’s position, and thus the wave function 
is not reduced to an eigenfunction of the position operator unless we observe 
which of the two slits the particle moves through. Some physicists interpret 
this to mean that a quantum-mechanical measurement requires the presence of 
a human observer. Another interpretation proposes that the wave function is 
reduced when the experimental result is registered by an apparatus. On the 
other hand, such an apparatus has to be able to be described by a (naturally very 
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complicated) wave function. The question then arises when will that wave 
function be reduced. 

. This problem may be circumvented by demanding macroscopic objects, for 
instance particle detectors, to follow the laws of classical mechanics exactly. This 
idea has the serious disadvantage that the boundaries between a classical and 
quantum-mechanical region are unclear. In addition, there are no experimental 
facts indicating the existence of such a boundary. 

Now let us discuss the problem of wave-function reduction by considering 
the Stern-Gerlach experiment. We have an apparatus oriented in the z direction 
(abbreviated by SGz), and a beam of spin-5 particles splitting into two parts 
according to thc two eigenvalues +3h of s,. The corresponding eigenvectors are 
Z, and Z_. Before the experiment, the wave function is given by 


Theo 
j= foe (17.19) 


ji 


The experiment may be illustrated graphically: 

The measurement and thus the reduction of the wave function, is not 
obtained, however, by just moving through a Stern-Gerlach magnet. To under- 
stand this, we look at the following experiment: 


The particle beam with s,= + 3h moves through an apparatus in the 
z direction, where it is split up. The particle beams are reunited so that it is 
impossible to say which path was taken by which particle. Thus no information 
on the value of s, has been gained. If there is no phase difference between the two 
paths, the second measurement made in the x direction will yield the same result: 
the wave function was not reduced (filtered) to Z. or Z_; it is still Z. This 
consideration shows us that for measuring a spin component a detector has to 
be present in the path the particle has chosen. This may be done by filtering out 
one of the two beams behind the SGz apparatus. This filtering out is then, so to 
speak, the detector just mentioned. But now a reduction of the wave function 
takes place, caused by the existence of this second detector. 

To develop a complete quantum-mechanical measurement theory, it is 
necessary to treat the measuring apparatus quantum mechanically. Thus 
new problems arise. To clarify this point, let us look again at a Stern-Gerlach 
apparatus oriented in the z direction. Now, the passage of a particle is to be 
registered by a detector described by a wave function. The three possible states 
of the detector are 79 before registering the particle, x» if s, = gee) aieal 
if s, = —}h. First we consider the case of the incoming particle being in an 
eigenstate of S,. If the particle’s spin is described by Z,, the wave function of the 
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Fig. 17.6. A Stern-Gerlach 
experiment (schematic) 


Fig. 17.7. Experimental set- 
up for multiple Stern- 
Gerlach measurements 
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total system, consisting of particle and detector, is given by 


Wo=Zsko - (17.20) 
After the particle has moved through the apparatus, the total wave function is 

We = Zin. (17.21) 
The state of the incoming particle is Z_ and thus 

ie Ome: (17.22) 
The state of the system after the experiment follows as 

a= 2 ae (17.23) 


Now we look at the case of the particle not being in an eigenstate of S,. For 
instance, before the experiment, its wave function is 7,. The initial state of the 
system is then 


(Z4 + Z_)xXo 
Wo = Lag = ae 


The time evolution of the total system is described by the time-dependent 
Schrédinger equation with the time-independent Hamiltonian. Thus it follows 
that each part of the sum on the right-hand side of (17.24) develops in time 
as described by (17.21) and (17.23). Therefore, after the measurement, the total 
wave function is 


(17.24) 


_ (Zan eae) 
w, 


Some of the problems of a measurement theory are encountered in (17.25). First 
we notice that the detector’s state is not 79, y_-, or 71, but a combination of 
y+ and y_. If the detector had a pointer capable of staying in three positions 
according to the three possible states of the detector, the pointer would be 
placed between two of these positions. Thus our experimental experience is 
contrary to the assumption that macroscopic objects are described by a wave 
function. 

A second problem suggested by (17.25) is that it contradicts the postulate of 
the reduction of wave functions, claiming that the wave function is in one of the 
states Zi y4 or Z_y- after the measurement. 

A third problem arises in trying to solve the first two: if the system — particle 
and detector — is considered as one quantum-mechanical system, wave-function 
reduction should take place when the state of the total system is measured. Thus 
we should use a second apparatus to measure the state of the detector, and 
afterwards the wave function would be reduced to either Z_y_ or Zy,. But 
that additional experimental apparatus is again part of a larger total system 
whose wave function is of the form of (17.25). This procedure could be repeated 
arbitrarily often. Obviously, there is no fixed point at which we could determine 
that the wave function is reduced. 


wv (17.25) 
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17.6 Schrodinger’s Cat 


These apparently paradoxical results can be illustrated more vividly by a Gedan- 
kenex periment conceived by Erwin Schrédinger. We assume that the result of the 
previously discussed quantum-mechanical measurement triggers a rifle pointing 
at a cat. The total “experimental setup” should be placed in a box to be opened 
later on to observe the cat’s state (alive or dead). Making that observation, 
the result of the measurement can be deduced. In analogy to the previous 
discussion, the wave function of the box is y, or y_, depending on whether the 
cat is alive or dead. The total wave function of the box containing cat, particle 
and detector is again given by (17.25). This, of course, would have the conse- 
quence that the state of the cat before opening the box would be neither alive nor 
dead. If we reject this absurd conclusion, we have to ask when the wave-function 
reduction takes place: When the particle enters the apparatus? When the cat 
dies? Or at some other time? 

In the following, we present different attempts to formulate a quantum 
theory of measurement, trying to solve all the problems caused by wave-function 
reduction. 


17.7 Subjective Theories 


One theory solving all these problems was developed by E.P. Wigner. All we 
know of the physical world is the information reaching our brains through our 
senses and remaining in our mind. Therefore, Wigner postulated that wave- 
function reduction takes place when the information arrives at our brain. Thus 
particle, detector and cat remain in states described by wave functions of the 
form (17.25) until the box is opened by someone. Then the wave function is 
reduced. 

Although this theory is consistent with all observed results, it is nevertheless 
unsatisfactory for various reasons. First, the theory assumes that the human 
mind is of a different nature than the physical material world, to which the brain 
also belongs. It is difficult to believe this assumption. It should be possible to 
describe the natural world with objective concepts, independent of our existence 
or our interaction with nature. Furthermore, the entire problem is shifted into 
an inaccessible region: if all physical knowledge exists only in the mind, and the 
mind is not an object of physical analysis, all of physics (natural science) loses 
its objective relevance. Finally, it is difficult to explain how minds of different 
persons reach the same conclusions concerning the results of physical experi- 
ments, if we do not admit the existence of an objective physical world. 

Apart from these objections, the subjective theories do give an explanation of 
quantum-mechanical measurement, and some philosophers and natural scien- 
tists think that they offer the best explanation available at this time. 
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17.8 Classical Measurements 


A completely different hypothesis is based on the postulate that classical 
mechanics may not be just a limit of quantum mechanics, but an independent 
theory, in which measured results are definitely fixed at all times. Detectors and 
cats should comply with physical laws different from those followed by particles 
belonging to the atomic and subatomic parts of the world. The reduction of 
the wave function takes place when particles interact with macroscopic objects. 
Since this interaction cannot be treated quantum mechanically, the problems 
discussed above do not occur. 

This kind of hypothesis has the major disadvantage of introducing two 
different theories describing, in a complicated manner, the behaviour of material 
objects of different magnitude. A second problem is how to determine the 
boundary between macroscopic and microscopic objects, as can again be illus- 
trated with the help of a Stern-Gerlach experiment. As shown in Chap. 1, this 
experiment is usually performed with atoms [and not with much smaller 
(microscopic) objects like electrons]. The possible paths of the atoms may be so 
far apart that they can be treated as classical objects. Furthermore, there is no 
reason why the experiment should not be performed with the spins of consider- 
ably more massive particles, e.g. uranium nuclei. Previously, we noted that the 
wave function stays unchanged if, after the splitting up, the paths of flight are 
reunited, so that there is no way to decide which path a particle has chosen. 
These classical objects should also be led by the guiding field. Furthermore, the 
beam of classical particles should interfere at a double slit, as discussed at the 
beginning of this chapter. 

It is extraordinarily difficult to perform a Stern-Gerlach experiment in the 
manner just described. Both possible paths through the apparatus have to be of 
identical length (up to an accuracy of 107° m) to reconstruct the original spin 
wave function. A real test of the possible existence of a guiding field for clas- 
sical particles has not yet been made. New experiments examining the tunnel- 
ling effect of electrical flow across superconducting connections may prove 
to be such a test. The theory of classical measurement can only be acceptable 
if the classical result of an experiment differs from the quantum-mechanical 
predictions. 


17.9 The Copenhagen Interpretation 


This measurement theory, which has been widely accepted for the longest period 
of time, was developed by Niels Bohr and his colleagues in Copenhagen. Their 
premise is the impossibility of separating the quantum-mechanical system from 
the measuring apparatus. Thus a spin-} particle approaching an SGz apparatus 
has to be considered as a totally different system from a similar particle 
approaching an SGx apparatus. The problem of wave-function reduction does 
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not occur, since the choice of wave functions (Z,; and Z_ or yx; and y_) is 
determined by the details of the experimental setup. If we reunite both separated 
beams again as discussed above, we change the experimental setup and thus 
also change the corresponding wave functions. This aspect of the Copenhagen 
interpretation is supported by the experimental and theoretical results concern- 
ing systems of correlated particles mentioned above. As we have seen, particle 
pairs behave as a single system with properties that cannot be explained by the 
properties of the individual particles (see e.g. Chap. 15). 

One further idea of the Copenhagen interpretation is that of complemen- 
tarity. Some properties (e.g. position, momentum, the x and z components of 
angular momentum etc.) form complementary pairs. From this point of view, 
it follows naturally as a principal property of nature that every attempt to 
determine one variable yields an uncertainty in the complementary variable. An 
example is given by polarized light: the question of linearly polarized light being 
left- or right-handedly polarized is obviously senseless. The opinion of the 
Copenhagen school is that in quantum mechanics, the attempt to determine 
position and momentum at the same time is equally senseless. 

The Copenhagen interpretation does not consider the nature of the measur- 
ing apparatus. Although it does solve the problem of wave-function reduction, it 
leaves unclear whether macroscopic objects, including measuring apparatuses 
have to be treated by wave functions, or if a totally different theory for that task 
is necessary. 


17.10 Indelible Recording 


Up to now, our discussion has shown that the difficulties of a quantum theory 
of measurement result from a contradiction between the time-dependent 
Schrodinger equation [yielding (17.25)] and the reduction postulate; perhaps 
the reduction postulate is not compatible with quantum mechanics (the 
Schrédinger equation). Now we will examine this contradiction more carefully, 
following Belinfante’s® argumentation, which points out the contradiction and 
solves it. 

We recall that after measurement the wave function of the total system — 
particle and detector — takes, according to quantum mechanics (the time-depen- 
dent Schrédinger equation), the form W = (Zix+ + Z-y-)/\/2 [see (17.25)], 
whereas, according to the reduction postulate, it should be Z474 or Z_y-. 
Clearly both predictions concern the probabilities of the possible results of 
4 measurement. Therefore they must not be applied to one experiment with 
a single system; a large number of experiments has to be performed, either on the 
same system or on other identical systems, before predictions can be checked. 


6 FJ. Belinfante: Measurement and Time Reversal in Objective Quantum Theory 
(Pergamon, Oxford 1978). 
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Only if the system is already in an eigenstate of the measuring operator and 
quantum mechanics predicts a certain result, is a single measurement (a single 
system) sufficient. A multitude of identical experiments is called an ensemble. 

Let us consider an ensemble of SGz measurements of particles with a 
wave function w,. After measurement, the ensemble resides, because of the 
Schrodinger equation, in a so-called “pure state” with a wave function of the 
form (17.25), whereas, according to the reduction postulate, the ensemble should 
be in a “mixed state”, i.e. each of the two eigenstates of §, should be populated by 
one-half of the particles. 

Let us consider how to decide whether the ensemble is in a pure or a mixed 
state. For this a second measurement by a person or another apparatus (e.g. 
looking at the counting rates of the detectors) is necessary. That measurement 
described by the operator Q should follow the reduction postulate (i.e. the 
counting rates can be read off unambiguously). If the system is in a pure state, 
the expectation value of 6) follows from (17.25): 


(O> = 2) (24x + 22x) OZix4 + Zar, (17.26) 


where all variables necessary for the specification of particle and apparatus are 
contained in the volume element dt. Multiplication yields 
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(17.27) 
Here we have taken into account the Hermiticity of Q. 
To calculate the properties of a mixed state, we have to consider that the 
expectation value of O in a mixed state is equal to the average of the expectation 
values, which are calculated by separate measurements with the wave func- 


tions Z,74 and Z_y_. Since the number of particles is the same in both states, it 
holds that 


KO) =4 Zty*OZiydr thf Ztxt OZ yd . (17.28) 


A comparison of (17.27) and (17.28) shows both expectation values to be 
identical if i 


Dc WEEE a al = (17.29) 


It can immediately be seen that a similar condition also guarantees that the 
probability distributions [not just the averages, as in (17.27) and (17.28)] of the 
possible results of the measurement of Q are independent of whether the system 
is in a pure or a mixed state. If we can show that expressions of the form Q, — 
vanish for all physically possible operators Q, then pure and mixed states 
are indistinguishable. Then the reduction postulate would be compatible with 
quantum mechanics; it would, in fact, be a consequence of it. 

Now we want to consider the conditions under which the two states yy 
and W_ yield a vanishing integral (17.29). The quantity |Q, _|? can be interpreted 
as being proportional to the probability of a transition between the states Z4. 74 
and Z_y_, caused by the action of the measurement operator Q. If QO, - 
vanishes, a transition between the states is impossible, meaning that the particle 
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would have changed the state of the detector irreversibly; we could describe it as 
an indelible recording of the event. This is just the property we usually ascribe to 
a measuring apparatus: it registers the result until it is returned to its initial state 
by an external action. 

So we see that the reduction of the wave function is induced by the 
measuring process itself, namely, by the recording of the result: it is not induced 
by human perception or the classical behaviour of macroscopical objects. 
Furthermore, we see that reduction is not absolutely necessary; after an indelible 
recording has been made, the total system may be described either by a pure 
form (17.25) or a mixed form consisting of reduced wave functions. Nevertheless, 
it is more useful to choose reduced wave functions, since the results of further 
measurements of the particle can be calculated without knowledge of the 
complicated details of the first apparatus. Reduction enables us to describe 
an isolated physical system without considering the other systems which have 
interacted irreversibly with it; we could do so, but this is unnecessary, since we 
know that a description with pure states yields the same results. 

One objection to this argument is the fact that no process is totally irrevers- 
ible: there is always a small but finite probability, for instance, of the detector 
changing its state. There are two possibilities: either the apparatus remains 
isolated, and the change takes place by chance, or the apparatus is manipulated 
by an external influence (e.g. by a person). In the first case, the erasure of the 
recording means that the measurement — and thus the reduction — has not taken 
place, as discussed above. In the second case, we observe that the assumption of 
a pure state can, in principle, only be made for a wave function describing 
everything that interacts with the apparatus. Although the experimental result 
was erased, the information on the detector’s state caused an irreversible change 
in the interacting system (possibly including a human being). As an extreme case, 
the total universe may be considered one system. But if we assume that the 
probability of the reversal of an “irreversible” change is finite, we contradict 
ourselves, because in a universe occupying a state already occupied some time 
ago, we could not find out whether time has passed or not. Here we assume 
(contrary to subjective theories) that our minds are a part of the physical 
universe. 

The principle of indelible recording combined with the ideas of the Copen- 
hagen interpretation yields an objective and economical quantum theory of 
the measurement process. Nevertheless, this theory is not universally accepted. 
Some scientists think that the difficulty in determining when an indelible 
recording is made cannot be solved by the argument briefly described here. 


17.11 The Splitting Universe 


We end our survey of measurement theories with E. Everett’s concepts based on 
the idea of an irreversible change in the universe. He assumes that the universe 
does not end in one of the various possible states as a result of a measurement, 
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but that all possible results really take place. Accordingly, the universe must 
split up into a number of different noninteracting — and thus noncommuni- 
cating — universes. So if we observe an apparently random result of a quantum- 
mechanical experiment, all results will have taken place in a totally deter- 
ministic manner. We observe a special result in one branch of the splitting 
universe but other “versions” of our existence observe different results in 
different branches. Of course, the splitting of the universe is not restricted to the 
cases in which a measurement is taking place, but occurs for each quantum 
event. Thus the universe has to be thought of as continuously splitting. 

The splitting-universe model treats the problem of wave-function reduction 
in a manner similar to that of the indelible-recording concept. Integrals like 
(17.29) vanish, because now the states Z.y, and Z_y- belong to different 
universes not interacting with each other a priori. In fact, many of the ideas in 
the last section were first developed in connection with the splitting universe and 
then used by Belinfante for his model. 

The concept of a splitting universe is naturally uneconomical, and the idea of 
an infinite number of universes can never be proved, since individual universes 
do not interact. This model is thus accepted by only a small number of scientists, 
although its clearly deterministic character is attractive. 


17.12 The Problem of Reality 


A fundamental question of any theory of the physical world deals with the 
nature of that which really exists. This problem is particularly acute in quantum 
mechanics, but a complete discussion of this philosophical topic would exceed 
the range of this book. Therefore we restrict ourselves to a short introduction to 
the basic ideas involved. 

For quantum mechanics, the wave function has no direct physical meaning, 
but is a theoretical construction that can be used to derive the probabilities of 
later events. On the other hand, we have seen in this chapter that the assumption 
of additional properties of quantum systems (namely, hidden variables), which 
are not contained in the wave function, lead to results contrary to experiment. 
If the wave function is not physical, and there are no hidden variables, what, 
in fact, does exist? 

One possible answer to this question is again offered by subjectivism: all we 
know to exist is that which we perceive through our senses; therefore only they 
are real. Another answer is given by positivism, which claims that the question 
we are posing is senseless, since the existence or nonexistence of objects which 
we do not perceive cannot be verified. Assuming that we are part of a gigantic 
brain (computer) where all objects of the universe, their motion etc. exist as 
fabrications (as computer games), it would not be possible to distinguish be- 
tween that brain and a real world. 

The base of an objective theory of reality could be given by the concept of 
indelible recording. These recordings or the irreversible changes in the universe 
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really exist; any real reversible change cannot be observed, and therefore 
statements concerning their existence are pointless. If we call these reversible 
changes unobservable, we are not necessarily referring to the influence of 
a human observer: such reversible “events” just have no influence on the later 
development of the universe. These unimportant phenomena we identify with 
the total universe regaining a state that already occurred during its evolution, or 
with the “everyday” example of a spin-} particle travelling through one channel 
of a Stern-Gerlach experiment without having its path recorded. Thus quantum 
mechanics is a theoretical construction, enabling us to predict sequential, 
irreversible events in the universe. Although we speak of wave functions and 
particles, only their irreversible effects can be considered objectively existent. 

Finally, we again stress the point that the purpose of this chapter was not to 
solve the conceptual problems of quantum mechanics, but to show that such 
problems exist, and that a variety of possible, often amusing answers exist as 
well. Quantum mechanics has been extraordinarily successful in predicting 
quantities such as energy spectra, transition probabilities, cross sections etc.; 
nevertheless, the consideration of problems concerning the philosophy of 
science can enrich our thinking and provide additional insight into our world 
and, perhaps, even ourselves. 
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